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The past twenty-five years have 
witnessed a growing interest on the 
part of 
and demographers in the relation- 
ship between intellectual level and 
family size. As a result of the nega- 
tive correlations commonly found be- 
tween intelligence test scores of chil- 
dren and number of siblings, several 
writers have predicted a drop in the 
intellectual level of the population. 
Data which have recently become 
available from a variety of sources, 
however, raise serious doubts regard- 
ing such a conclusion. Careful an- 
alyses of the problem, moreover, have 
revealed that it is far more complex 

both methodologically and theoret- 
ically—than was originally supposed. 


psychologists, geneticists, 


Thr GENERAL QUESTION OF 
DIFFERENTIAL FERTILITY 


In current English demographic 
usage, the term “fecundity” signifies 
capacity to produce living offspring, 
whereas “‘fertility’’ refers to actual 
performance (60). Some _ investi- 
gators have differentiated further 
among number of pregnancies, num- 
ber of births (including stillbirths), 
number of children born alive, and 
number of children alive at the time 
of the survey (71). For practical rea- 
however, most have 
been concerned only with the last- 
named category. ‘To be precise, one 
should also identify foster children 
and stepchildren within each family, 


sons, studies 


but not all studies have done so. The 
number of such cases, of course, is 
relatively small and would not ma- 
terially affect the conclusions. 

Mention should also be made of 
the frequently reported “crude birth 
rate,’ which is simply the number 
of births during a year, per thousand 
population. Such a figure reflects 
not only mean size of family but also 
other demographic characteristics, 
such as the age distribution, sex 
ratio, and marriage habits of the 
population. Adjusted birth rates are 
sometimes computed, in which one 
or more of these characteristics are 
controlled. On the other hand, when 
average size of sibship is considered, 
as in most studies on fertilitv and 
intelligence, no account is taken of 
childless families, proportion of un- 
married nersons, and differential mor- 
tality rates. It is thus apparent that 
fertility statistics can be variously 
expressed and that their specific im- 
plications need to be carefully scru- 
tinized. 

Most demographers agree that, al- 
though capacity for procreation may 
be influenced by genetic factors and 
by such environmental conditions as 
nutrition and climate, variations in 
fertility among existing human popu- 
lations arise chietly from social and 
psychological factors (60, 96). Simi- 
lar explanations have been offered for 
progressive changes in fertility within 
given populations. In the United 
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States and in most European coun- 
tries, the birth rate declined steadilv 
from the early or late nineteenth 
century to the 1930's, reaching its 
lowest point during or immediately 
following the economic depression. 
This decline has usually been attrib- 
uted to socioeconomic factors, such 
as urban migration, the rising stand- 
ard of living, and the increasing ex- 
pense of rearing children (96). Fol- 
lowing World War II, the trend 
was reversed, the crude birth rate for 
these countries rising markedlv and 
reaching a peak in the late 1940's. 
Much of this rise is believed to be 
due to the abnormally large number 
of marriages during and immediately 
after World War II, as well as to 
births which had been postponed 
during the depression or early part 
of the war, or which were advanced 
from later vears because of the eco- 
nomic prosperity (96). Some demog- 
raphers, however, see evidence of a 
genuine increase in size of completed 
families (96). 

Differential fertility in various sub- 
divisions of a population was recorded 
as early as the seventeenth century 
in Europe (96). In a survey of British 
demographic writings appearing be- 
tween 1660 and 1760, for example, 
Kuezynski (52) found references to 
the greater fertility of rural as com- 
pared with urban dwellers, and of the 
poor as compared with the wealthy. 
The data upon which these early 
opinions were based are, of course, 
meager and difficult to interpret. In 
more recent times, however, a mass 
of data has accumulated which shows 
an inverse relationship between fam- 
ily size and such variables as income, 
occupational level, and amount of 
education (47, 61, 84, 96). The pre- 
viously mentioned decline in birth 
rate which began in the nineteenth 
century was more rapid in the upper 
than in the lower socioeconomic and 
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educational classes, thus either pro- 
ducing or augmenting such fertility 
differentials. 

Certain other findings regarding 
fertility differentials are of special 
interest. One of the most persistent 
differentials, occurring even when 
other subgroup differences in family 
size are absent or reversed, is that 
between urban and rural groups. 
This difference usually increases when 
the comparison is between agricul- 
tural and nonagricultural occupa- 
tions. The largest families are thus 
found among those rural residents 
who are engaged in agriculture (96). 
Another suggestive finding, reported 
in an American survey, is that family 
size tended to be more closely related 
to the educational level of the wife 
than to that of the husband (5). 
Finally, it is noteworthy that a com- 
parison of native whites and Negroes 
in the United States reveals that the 
fertility rates of both groups are af- 
fected in the same way by such fac- 
tors as socioeconomic status, educa- 
tion, and urban-rural residence (53). 

All of the above socioeconomic 
and educational differences in fer- 
tility have been interpreted as in- 
direct evidence for a relationship be- 
tween fertility and intelligence. Inso- 
far as occupational and educational 
level are positively correlated with 
intelligence test scores, and urban 
scores average higher than rural, the 
inverse relation of these variables 
with family size suggests a similar 
inverse relation between intelligence 
and family size. Even prior to the 
direct study of the latter relation- 
ship, therefore, its implications had 
been considered by geneticists and 
demographers. 

There are, however, a number of 
noteworthy exceptions to the rela- 
tionships cited above. For example, 
several local studies in American, 
Canadian, French, German, Nor- 





INTELLIGENCE AND FAMILY SIZE 


wegian, and Swedish cities have 
shown that the negative correlation 
between income and family size pre- 
vails only up to a certain income level 
(96). Moreover, within the higher 
income groups, there is sometimes a 
tendency for family size to be posi- 
tively correlated with income. Simi- 
larly, exceptions can be found to the 
hypothesized relation between fer- 
tility and occupational level (30). 

Special mention should be made of 
the Indianapolis survey, conducted 
in 1941-42 (48, 49, 50, 51, 106, 107). 
This study began with a short ques- 
tionnaire administered to approxi- 
mately 41,500 native, white couples 
living in Indianapolis, in which the 
wife was under 45 vears of age and 
neither had previous mar- 
riages. The major data were gathered 
in an intensive interview survev of 
about 1,500 women selected from the 
larger sample because they met cer- 
tain additional specifications. The 
investigation was designed to test a 
number of hypotheses regarding so- 
cial and psychological factors affect- 
ing fertilitv. Many of the findings 
are difficult to interpret, however, 
the complex interrela- 
tionships and interactions of the vari- 
ables concerned (cf. 48, 51). Never- 
theless, one observation is of special 
interest in the present connection. 
Although the usual inverse relation- 
ship between socioeconomic indices 
and fertility was found over a large 
part of the range, this relationship 
changed from negative to positive 
within the highest income levels. 
Such a result held in both the total 
sample and the subsample chosen for 
intensive study. It was likewise 
found to be true in another subsam- 
ple including only wives aged 40 to 
44 years, who were more likely to 
have completed families at the time 
of the survey (50). 

One of the hypotheses proposed to 


spouse 
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account for such results is based upon 
the relation of upward social mobility 
to family size (104). There is some 
direct evidence in support of this 
hypothesis. Baltzell (6), for example, 
found larger families among residents 
of Philadelphia listed in both the 
1940 Who's Who and the Social Reg- 
ister than among those listed only in 
Who's Who. He argued that the lat- 
ter had more often achieved their 
socially prominent position through 
their own efforts rather than having 
occupied it throughout life. They 
had thus shown more upward mo- 
bility than the Social Register group. 
Other analvses conducted within the 
same study, involving persons with 
a private school education and those 
whose families had been listed in 
Who's Who since 1900, likewise indi- 
cated greater fertility among the so- 
cially less mobile groups. Attempts 


to test the hypothesis among inter- 
mediate socioeconomic groups have 


vielded inconsistent results. At such 
levels, it is difficult to find groups in 
which the drive for upward social 
mobility can be assumed to be negli- 
gible. For example, persons whose oc- 
cupational levels are higher than 
those of their fathers or who have ad- 
vanced in occupational level within 
their own vocational history have 
been compared with those who failed 
to show such changes (7). Even the 
“control” group which has mani- 
fested no occupational changes of 
this tvpe, however, may have mani- 
fested upward mobility in other ways. 

It is apparent from various sources 
that, even prior to the postwar “‘baby 
boom,” generalizations regarding the 
inverse relationship of fertility to 
socioeconomic factors needed qualifi- 
cation. Following World War II, 
fertility increased relatively more 
among those groups in which the pre- 
war level had been low, i.e., among 
urban residents and persons in higher 
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educational, occupational, and_ in- 
come levels. Thus the increasing 
birth rate has had the effect of nar- 
rowing fertility differentials within 
the population. These changes are 
vividly illustrated by some of the 
data gathered in recent sample sur- 
veys by the United States Bureau of 
the Census (98, 99). For example, 
when married women between the 
ages of 15 and 44 were considered 
in the 1952 survey, no relationship 
was found between income level and 
total number of children, except for 
a proportionately large number of 
children in the lowest income group 
(under $1,000). When the same 
women were classified according to 
husband's occupation, some reversals 
in the usual fertility differentials 
were observed. Thus, professional, 
semiprofessional, and managerial 
groups had proportionately more 


children than did certain lower occu- 
pational groups, such as clerical and 


sales workers.’ Fertility differentials 
among educational groups have also 
decreased (98). The remaining dif- 
ferences between educational classes 
are still large, however, as are those 
between urban and rural groups. 

A similar narrowing of fertility dif- 
ferentials has been reported for a 
number of European countries (cf. 
16, 96). For example, there is evi- 
dence of an actual increase in family 
size in the upper social levels of Swe- 
den and Great Britain. Moreover, 
the decrease in fertility differentials 
holds for comparisons between coun- 
tries as well. Thus it is largely in the 


1The fact that significant differences in 
family size were obtained among occupational 
groups, while no such differences were found 
among income levels within the corresponding 
range, reflects the low correlation between in- 
come and occupational level. The range of 
income within each of the major occupational 
categories is very wide, and the income differ- 
ential between such occupational groups has 
been declining for many years. 
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low-fertility countries that the birth 
rate has increased. 

Mention may also be made of con- 
ditions prevailing within high-fer- 
tility areas. These are identified as 
regions having a crude birth rate in 
the vicinity of 40 per thousand, or 
higher (96). Such birth rates have 
been reported among peoples of 
Africa, Asia, and Latin-America; 
among non-European peoples of 
Oceania; and in certain parts of Eu- 
rope. For most of these groups, demo- 
graphic data are very inadequate. 
In general, the birth rate in these 
areas has shown no upward or down- 
ward trend, unlike the birth rate in 
low-fertility areas. Moreover, the 
usual fertility differentials have not 
been observed in high-fertility areas. 
In Brazil, for example, the urban- 
rural differential is the only one which 
follows the familiar pattern.  Re- 
ported socioeconomic differentials in 
that country are slight and not in the 
expected direction. Thus within each 
type of work, the fertility of employ- 
ers and own-account workers is gen- 
erally higher than that of employees 
and unpaid family workers (96). The 
data from high-fertility areas, how- 
ever meacer, provide another major 
exception to the negative correlation 
between level and 
family size. 


sf cioecon mic 


STUDIES OF INTELLIGENCE TEST 
PERFORMANCE IN RELATION 
TO FAMILY SIZE 


While much has been inferred 
about the relationship between fer- 
tility and intelligence from studies on 
socioeconomic factors, a direct ap- 
proach to this problem has also been 
utilized. Current concern with this 
question on the part of demographers 
is illustrated by the inclusion of a ses- 
sion on ‘‘methods of research on rela- 
tions between intelligence and fer- 
tility” in the World Population Con- 
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ference held under United Nations 
auspices in Rome in 1954 (97). Simi- 
larly, a special working group was 
called together by Unesco to dis- 
cuss the implications of fertility dif- 
ferentials for the intellectual develop- 
ment of the population. This group 
met in Paris early in 1954 in order to 
draw up a statement which was con- 
tributed by Unesco to the World 
Population Conference (17). 

Investigations on the relationship 
between family size and intelligence 
test have been conducted in 
several countries, although the liveli- 
est Interest in this problem has been 
manifested in Great Britain. Among 
such studies, the outstanding exam- 
ples are the Scottish surveys (81, 82, 
83). In each of these surveys, an ef- 
fort was made to test a complete age 
sample. In 1932 and again in 1947, a 
45-minute group intelligence test 
including verbal and pictorial ma- 
terial was administered to all 11-vear- 
old children in Scotland. The sam- 
ples actually tested consisted of 
87,498 and 70,805 children in the 
first and second surveys, respectively. 
These samples are described as com- 
plete except for the children whose 
sensory or motor handicaps precluded 
the valid administration of this test, 
those school children who were absent 
on the day of testing, and a few chil- 
dren attending certain private schools 
for whom the necessary background 
data could not be obtained. In terms 
of the total population of 11-year-old 
Scottish children, the two samples af- 
forded approximately the same cov- 
erage. Since the estimated number 
of 11-year-old children in Scotland 
was 100,300 in 1932 and 80,300 in 
1947, the percentages covered in the 
two samples were 87 and 88, respec- 
tively. 

Surveys with individual tests were 
likewise conducted on more narrowly 
defined samples. All children bern in 


score 


191 


Scotland on the first day of February, 
May, August, or November, 1926 
were given the 1916 Stanford-Binet 
and a series of eight performance 
tests (63). This group comprised 
874 cases, tested over a three-year 
period at ages ranging from 8-11 to 
11-9. In connection with the 1947 
group test survey, 1,215 children were 
also tested with Form L of the 1937 
Stanford-Binet, as modified for Scot- 
tish children. This group, known as 
the ‘six-day sample,” included chil- 
dren born in Scotland on the first 
day of the even months in 1936. The 
coverage of the individual test sur- 
veys is more nearly complete than 
that of the group test surveys, special 
efforts having been made to reach 
every child who met the birthday 
specifications. Moreover, the 1,215 
children constituting the “six-day 
sample" of the 1947 survey were stud- 
ied more intensively in terms of 
background characteristics. The 
same children are currently being fol- 
lowed up to determine the effect of 
family size and other factors upon 
their subsequent development (66). 
In the earlier Scottish surveys, no 
data on family size were obtained, al- 
though the results of these surveys 
were employed in the follow-up stud- 
ies to be discussed in a later section 
of this paper. In the 1947 survey, the 
group test? yielded a correlation of 
—.28 with size of sibship, and the 
Stanford-Binet a correlation of —.32. 
Comparisons of mean scores likewise 
showed a consistent and significant 
drop with increase in family size. 
Thus the mean Stanford-Binet IQ 
dropped steadily from 113 for “only” 
children to 91 for children with five 
sibs (82). On the group test, the 


*Only the verbal part of the group test 
was included in this analysis. The results ob- 
tained with the pictorial part proved to be too 
highly skewed because of low test ceiling; 
hence they have not been analyzed further. 
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mean decrease in score per unit-in- 
crease in family size was .13¢0 (83). 

The tendency for children from 
larger families to score lower per- 
sisted within occupational 
(83), a finding which has also been 
reported by other investigators (14, 
21). All such studies, however, have 
employed fairly broad occupational 
categories. Thus it cannot be as- 
sumed that socioeconomic differences 
have been ruled out when compari- 
sons are made within single occupa- 
tional levels. It is also interesting to 
note that, in the highest occupational 
levels, the negative correlation be- 
tween intelligence and family size 
may disappear or be replaced by a 
positive correlation. Cattell (21) and 
Moshinsky (68) report that the high- 
est negative correlations are found 
in occupations of intermediate so- 
cial status. In this connection, ref- 


classes 


erence may again be made to the 
factor of upward social mobility, dis- 
cussed in the preceding section. It 


is at the intermediate social levels 
that such a factor would be most 
likely to operate. 

In the 1947 Scottish survey, group 
test scores were available for 974 
twins (83). Analysis of these scores 
corroborated the usual finding that 
twins score lower than singletons, 
the difference in this group corre- 
sponding to approximately 5 IQ 
points. It was further demonstrated 
that such a difference could not be 
attributed to family size or to socio- 
economic level. There was no tend- 
ency for the twins to come from 
larger families, when number of 
births rather than number of chil- 
dren was considered. Such _ socio- 
economic indices as relative size of 
home and paternal occupation indi- 
cated no inferiority of the twin sam- 
ple. In fact, there was an excess of 
twins in the professional and em- 
ployer groups, and a deficit in the 


unskilled labor group. The implica- 
tions of these findings will be con- 
sidered below, after other pertinent 
data have been examined. 

Another extensive survey is that 
conducted among 6- to 12-year-old 
French school children in 1943-44 
(36, 38). This survey included 
95,237 boys and girls, approximately 
a 2 per cent sample of the elementary 
school population of France. Care 
was taken to obtain a representative 
distribution of cases over the twenty 
regions of France. All subjects were 
given René Gille’s “test Mosaique,” 
a specially constructed pictorial group 
test of intelligence? When analyzed 
in respect to family size, the mean 
test scores exhibited a_ consistent: 
drop with increasing size of sibship. 
This differential was apparent within 
each year group, the difference in 
mean score between a_ one-child 
family and a family of eight or more 
being equivalent to 1 or 2 years of 
mental age. Within occupational 
groups, the negative relationship be- 
tween intelligence and family size 
was clearly apparent among farmers, 
manual laborers, and clerical work- 
ers; it was barely discernible in the 
managerial group and negligible in 
the professional class. 

A special study was concerned 
with the length of intersibling inter- 
val (36, Ch. 3; 90). Within the total 
sample covered by the French sur- 
vey, there were 1,244 two-sibling 
families in which both siblings had 
been tested. These were separated 
into “long interval” and “short in- 
ternal” sibships, the latter being 
defined as those falling at or below 


3 This test is described and completely 
reproduced in the first report of the French 
survey (38, pp. 75-96). Data on reliability 
and validity are included in the second report 
(36, pp. 21-45). It was called the “mosaic” 
test because it includes a wide variety of 
items. 
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the median interval. In the urban 
group the median interval was 24 
months, in the rural 23 months. On 
the intelligence the children 
with long intersib intervals obtained 
significantly higher means, these 
differences persisting within each of 
the five occupational categories into 
which the sample was subdivided. 
With long intersib intervals, the 
scores approximated those of only 
children; with short intersib inter- 
vals, they approximated the scores 
obtained by three-child  sibships. 

Test scores were also available for 
750 twins (36, Ch. 3; 90). In this 
survey, family size averaged higher 
for twins than for the total sample, 
even when the twins were counted 
as a single birth. Analysis of paternal 
occupation showed a greater propor- 
tion of farmers in the twin sample 
than in the total sample. With re- 


test, 


gard to all other occupational classes, 
however, the twin sample was supe- 


rior to the total sample. Thus a 
higher percentage of twins had fa- 
thers in the managerial and _ profes- 
sional classes, and a lower percentage 
in the clerical and manual labor cate- 
gories than was the case in the total 
sample. On the whole, then, the find- 
ings pertaining to occupation are 
similar to those of the Scottish sur- 
vey, in which paternal occupation 
was somewhat superior for 
than for singletons. 

The test scores of the twins were 
again inferior to those of the total 
sample. This difference was found 
at each age, and remained when 
family size and occupational level 
were controlled. In the case of fami- 
lies of four or more children, how- 
ever, the twin inferiority was negligi- 
ble and inconsistent, being most pro- 
nounced in the case of two- and three- 
sib families. Moreover, among the 
two-sib families, the twin scores were 
very similar to those of short-interval 


twins 
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sibs, as defined in the earlier part of 
the study. 

Other similar surveys have been 
conducted on a less extensive scale 
in several countries. In England, 
there have been a number of at- 
tempts to test complete age samples 
of children within restricted areas. 
Fraser Roberts and his co-workers 
(77, 79) administered the Otis test 
to all children born during a four- 
vear period and living within the 
city of Bath; a subsample was also 
individually tested with the Stan- 
ford-Binet. The Cattell Culture-Free 
Test was used by Cattell (18, 19) 
with all 10-year-old children in Lei- 
cester, a typical industrial city, and 
Devonshire, an ‘“‘unspoilt rural area.” 
Sutherland and Thomson (89) ap- 
plied a specially developed group in- 
telligence test to all 11-year-old 
school children in the Isle of Wight. 
Burt (14) reports the results obtained 
with the elementary school popula- 
tion of one of the London boroughs 
in the course of his standardization 
of the Binet scale. The Otis test 
scores of 393 10-year-olds in Sheffield 
were analyzed by Bradford (12). All 
of these studies confirm the findings 
of the more extensive surveys regard- 
ing the decline in mean intelligence 
test score with increasing size of sib- 
ship. When correlations were com- 
puted between test score and family 
size, the coefficients were uniformly 
negative and clustered in the .20’s 
and low .30’s. 

A similar inverse relation between 
intelligence and family size is re- 
ported by Papavassiliou (72), who 
gave a Greek adaptation of the Stan- 
ford-Binet to 349 children in Athens. 
Heinen (37) found a tendency for 
family size to be inversely related to 
school grades and other indices of 
academic ability in several large 
samples of German school children. 
In another survey of German school 
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mean decrease in score per unit-in- 
crease in family size was .130 (83). 

The tendency for children from 
larger families to score lower per- 
sisted within occupational 
(83), a finding which has also been 
reported by other investigators (14, 
21). All such studies, however, have 
employed fairly broad occupational 
categories. Thus it cannot be as- 
sumed that socioeconomic differences 
have been ruled out when compari- 
sons are made within single occupa- 
tional levels. It is also interesting to 
note that, in the highest occupational 
levels, the negative correlation be- 
tween intelligence and family size 
may disappear or be replaced by a 
positive correlation. Cattell (21) and 
Moshinsky (68) report that the high- 
est negative correlations are found 
in occupations of intermediate so- 
cial status. In this connection, ref- 


classes 


erence may again be made to the 
factor of upward social mobility, dis- 
cussed in the preceding section. It 


is at the intermediate social levels 
that such a factor would be most 
likely to operate. 

In the 1947 Scottish survey, group 
test scores were available for 974 
twins (83). Analysis of these scores 
corroborated the usual finding that 
twins score lower than singletons, 
the difference in this group corre- 
sponding to approximately 5 IQ 
points. It was further demonstrated 
that such a difference could not be 
attributed to family size or to socio- 
economic level. There was no tend- 
ency for the twins to come from 
larger families, when number of 
births rather than number of chil- 
dren was considered. Such 
economic indices as relative size of 
home and paternal occupation indi- 
cated no inferiority of the twin sam- 
ple. In fact, there was an excess of 
twins in the professional and em- 
plover groups, and a deficit in the 


socio- 
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unskilled labor group. The implica- 
tions of these findings will be con- 
sidered below, after other pertinent 
data have been examined. 

Another extensive survey is that 
conducted among 6- to 12-year-old 
French school children in 1943-44 
(36, 38). This survey included 
95,237 boys and girls, approximately 
a 2 per cent sample of the elementary 
school population of France. Care 
was taken to obtain a representative 
distribution of cases over the twenty 
regions of France. All subjects were 
given René Gille’s “test Mosaique,”’ 
a specially constructed pictorial group 
test of intelligence.* When analvzed 
in respect to family size, the mean 
test exhibited a consistent 
drop with increasing size of sibship. 
This differential was apparent within 
each year group, the difference in 
mean between a one-child 
family and a family of eight or more 
being equivalent to 1 or 2 years of 
mental age. Within occupational 
groups, the negative relationship be- 
tween intelligence and family size 
was clearly apparent among farmers, 
manual laborers, and clerical work- 
ers; it was barely discernible in the 
managerial group and negligible in 
the professional class. 

A special study was concerned 
with the length of intersibling inter- 
val (36, Ch. 3; 90). Within the total 
sample covered by the French sur- 
vey, there were 1,244 two-sibling 
families in which both siblings had 
been tested. These were separated 
into “long interval” and “short in- 
ternal” sibships, the latter being 
defined as those falling at or below 


scores 


score 


3 This test is described and completely 
reproduced in the first report of the French 
survey (38, pp. 75-96). Data on reliability 
and validity are included in the second report 
(36, pp. 21-45). It was called the ‘‘mosaic”’ 
test because it includes a wide variety of 
items. 
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the median interval. In the urban 
group the median interval was 24 
months, in the rural 23 months. On 
the intelligence the children 
with long intersib intervals obtained 
significantly higher means, these 
differences persisting within each of 
the five occupational categories into 
which the sample was. subdivided. 
With long intersib intervals, the 
scores approximated those of only 
children; with short intersib inter- 
vals, they approximated the scores 
obtained by three-child — sibships. 

Test scores were also available for 
750 twins (36, Ch. 3; 90). In this 
survey, family size averaged higher 
for twins than for the total sample, 
even when the twins were counted 
asa single birth. Analysis of paternal 
occupation showed a greater propor- 
tion of farmers in the twin sample 
than in the total sample. With re- 


test, 


gard to all other occupational classes, 
however, the twin sample was supe- 
rior to the total sample. 


Thus a 
higher percentage of twins had fa- 
thers in the managerial and _ profes- 
sional classes, and a lower percentage 
in the clerical and manual labor cate- 
gories than was the case in the total 
sample. On the whole, then, the find- 
ings pertaining to occupation are 
similar to those of the Scottish sur- 
vey, in which paternal occupation 
was somewhat superior for 
than for singletons. 

The test scores of the twins were 
again inferior to those of the total 
sample. This difference was found 
at each age, and remained when 
family size and occupational level 
were controlled. In the case of fami- 
lies of four or more children, how- 
ever, the twin inferiority was negligi- 
ble and inconsistent, being most pro- 
nounced in the case of two- and three- 
sib families. Moreover, among the 
two-sib families, the twin scores were 
very similar to those of short-interval 


twins 
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sibs, as defined in the earlier part of 
the study. 

Other similar surveys have been 
conducted on a less extensive scale 
in several countries. In England, 
there have been a number of at- 
tempts to test complete age samples 
of children within restricted areas. 
Fraser Roberts and his co-workers 
(77, 79) administered the Otis test 
to all children born during a four- 
vear period and living within the 
city of Bath; a subsample was also 
individually tested with the Stan- 
ford-Binet. The Cattell Culture-Free 
Test was used by Cattell (18, 19) 
with all 10-year-old children in Lei- 
cester, a typical industrial city, and 
Devonshire, an ‘“‘unspoilt rural area.” 
Sutherland and Thomson (89) ap- 
plied a specially developed group in- 
telligence test to all 11-year-old 
school children in the Isle of Wight. 
Burt (14) reports the results obtained 
with the elementary school popula- 
tion of one of the London boroughs 
in the course of his standardization 
of the Binet scale. The Otis test 
scores of 393 10-year-olds in Shefheld 
were analyzed by Bradford (12). All 
of these studies confirm the findings 
of the more extensive surveys regard- 
ing the decline in mean intelligence 
test score with increasing size of sib- 
ship. When correlations were com- 
puted between test score and family 
size, the coefficients were uniformly 
negative and clustered in the .20’s 
and low .30’s. 

A similar inverse relation between 
intelligence and family size is re-: 
ported by Papavassiliou (72), who 
gave a Greek adaptation of the Stan- 
ford-Binet to 349 children in Athens. 
Heinen (37) found a tendency for 
family size to be inversely related to 
school grades and other indices of 
academic ability in several large 
samples of German school children. 
In another survey of German school 
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children, Eydt (34) reported more 
retardation and behavior problems 
among children from large families. 

Several American studies have 
yielded similar results. Lentz (54) 
obtained a correlation of —.304 be- 
tween family size and _ intelligence 
test scores among 4,330 urban school 
children in five states. Within sepa- 
rate cities, however, the correlations 
varied from —.065 to —.411. This 
finding is not surprising in view of 
the fact that both the tests and the 
examiners varied from school to 
school. Chapman and Wiggins (24) 
reported a correlation of —.33 be- 
tween National Intelligence Test 
scores and size of family among 650 
school children. When the correla- 
tion was recomputed for children 
with native-born and with foreign- 
born parents, it fell to —.22 and 
—.24, respectively. The authors 


point out that most of the children 
came from completed families, since 
the youngest child in any sibship in- 


cluded in the study was 5 vears old. 

In a group of 1,140 8th grade 
school children in Oakland, Cali- 
fornia, Burks and Jones (13) found a 
correlation of —.22 between Terman 
Group Test and total number of sib- 
lings (including deceased siblings). 
This correlation rose to —.31 when 
corrected for restricted range in both 
10 and sibship size. In order to in- 
clude only completed families, the 
analysis was limited to sibships in 
which the youngest child was at 
least 9 years old. A correlation of 
—.31 between family size and Ter- 
man-McNemar intelligence test 
scores was obtained by Damrin (26) 
in a group of 156 high school girls. 

Hirsch (40) found a consistent de- 
crease in mean Otis IQ with increas- 
ing family size in a group of 214 Ten- 
nessee school children. The mean 
dropped from 115.8 for one-child 
families to 96.6 for eight-child fami- 
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lies. Among still larger families, the 
mean varied irregularly, dropping to 
82.0 for twelve-child families. In a 
study of more than 20,000 retarded 
children in the Massachusetts public 
Dayton (27) noted a de- 
crease in mean IQ from 73.2 for one- 
child families to 68.9 for families of 
ten or more children. 

A more thorough analysis of the 
problem is provided in a study by 
Conrad and Jones (25), who tested 
representative samples of the rural 
population of three New England 
states. The children were examined 
with the Stanford-Binet, and the 
adults with the Army Alpha. Family 
size was investigated in relation to 
intelligence test score of mother and 
father, education of mother and fa- 
ther, a social status index of the fam- 
ily, and mean intelligence test score 
of the sibship. In uncompleted fam- 
ilies, all of these measures correlated 
negatively with family size, although 
only the correlations with father’s 
intelligence and with social status 
reached statistical significance. In 
the completed families, on the other 
hand, all correlations were positive, 
but none was statistically significant. 
Further analyses showed that the 
families of lower intelligence began 
to have children at an earlier age, 
although their total child-bearing 
period was no longer than in the more 
intelligent families. These conclu- 
sions, of course, must be restricted to 
the particular population under in- 
vestigation. 

Similar results were obtained, how- 
ever, in a study of California fami- 
lies reported by Willoughby (108). 
On the basis of parental ages, the 
author concluded that most of the 
families were completed. Parental 
intelligence was determined from the 
combined score on six verbal and 
five nonverbal tests. Number of 
children correlated —.11 with the 


schools, 
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test scores of 108 mothers and +.06 
with the scores of 87 fathers, neither 
correlation being statistically signifi- 
cant. 
Another which intelli- 
gence test scores of parents were em- 
ploved is that of Willoughby and 
Coogan (109). Follow-up data were 
gathered on 373 persons who had 
graduated from high school 12 vears 
earlier and who had taken an intelli- 
gence test while still in school. No 
relationship was found between the 


study in 


original test scores and age at mar- 
riage, age at birth of first child, or 
number of offspring at the time of 
the survey. It is also interesting to 
note that within the same group a 
low but significant negative correla- 
tion was found between intelligence 
test score and the number of siblings 
of the subjects themselves. More- 
over, the subsequent education and 
the occupational level of the subjects 
revealed an inverse relation to num- 
ber of offspring. Although this study 
is inconclusive because of the small 
and selected sample and because of 
the incompleteness of the families, it 
nevertheless represents a promising 
approach to the problem.‘ Its chief 
merits lie in the fact that intelligence 
of parents was investigated directly 
and in the fact that such intelligence 
was measured prior to educational 
and vocational differentiation. 

An investigation currently in prog- 
ress provides further information on 
the relationship between intelligence 
of parents and number of offspring.5 
The original data for this study were 
gathered by E. Lowell Kelly as part 
of a broader project involving the 
testing, interviewing, and subse- 


* Cf. critique of this study by Fraser Rob- 
erts (78), but note that one of the criticisms 
is based upon a confusion between number of 
siblings and number of offspring. 

§ Personal communication from Dr. Charles 
F. Westoff, November 4, 1955. Cf. also (67). 
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quent 20-year follow-up of 300 en- 
gaged couples living in New England 
(46). The data pertaining to fertility, 
based on 216 couples, are being proc- 
essed by Charles F. Westoff and 
Elliot G. Mishler of the Office of Pop- 
ulation Research, Princeton Uni- 
versity. At the time of the initial 
testing, the mean age of the men was 
26.6 years and that of the women 
24.6, most of the subjects falling 
within the age limits of 21 and 30. 
Since data on number of offspring 
were obtained 20 vears later, it can 
be assumed that most of the families 
were completed. Educationally, the 
group represented a superior sample, 
more than half of the men and ap- 
proximately a third of the women 
being college graduates at the time 
of initial testing. The correlation be- 
tween initial Otis Intelligence Test 
and subsequent number of 
live births was .19 for the 216 men 
and .17 for the 216 women, both be- 


scores 


ing significant at approximately the 


O01 level. These correlations thus 
suggest a slight tendency for the 
brighter parents to have larger fami- 
It is noteworthy that none of 
the studies in which parental intelli- 
gence was correlated with number of 
offspring has so far vielded the nega- 
tive correlation customarily found 
when child intelligence is correlated 
with size of sibship. 

The relationship between fertility 
and intelligence has also been investi- 
gated within certain special groups. 
Moshinsky (68) analyzed the test 
scores of 10,159 English children en- 
rolled in different types of schools. 
In the relatively heterogeneous ele- 
mentary school group, a correlation 
of —.23 was found between intelli- 
gence and family size. Within more 
advanced, fee-paying schools, the 
correlations were negligible. Both 
Moshinsky and Blackburn (8) cite 
the latter finding in support of cer- 


lies. 





196 


tain hypotheses regarding the rela- 
tionship between intelligence test 
performance and family size. As 
Burt (15) points out, however, such 
a result is to be expected because of 
the greater intellectual homogeneity 
of the children in the more advanced 
schools. 

Hutton (42) reports an insignifi- 
cant correlation of —.05 between IQ 
and number of siblings in the case of 
boys attending a British secondary 
grammar school. The group is de- 
scribed as falling within the upper 
half of the general population in in- 
telligence. A questionnaire survey of 
alumni of the same school indicated 
that the highly selected scholarship 
holders as a group tended to have 
a smaller number of offspring than 
the general population of Great 
Britain. But among the more recent 


graduates the reverse was true, the 
scholars showing a higher replace- 
ment rate than the generality. 

A number of studies on college 


students, in both Great Britain and 
the United States, have yielded very 
low and usually insignificant correla- 
tions between intelligence test scores 
and number of siblings (39, 64, 65, 85, 
102, 103). This lack of correlation is 
understandable in the light of the 
highly selected nature of such sam- 
ples. Not only are college students 
selected intellectually; they are also 
selected with respect to the constella- 
tion of psychological and socioeco- 
nomic factors which determine who 
goes to college. Some of these factors 
may themselves be directly related 
to family size. 

In their study of California public 
school children with Stanford-Binet 
1Q0’s of 140 or higher, Terman and 
his co-workers found a correlation of 
—.271 between IQ and size of sibship 
within 91 completed families (91). 
Through the subsequent follow-ups 
of the entire gifted group, additional 
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data have been gathered on the com- 
pleted family size of nearly all of 
the original cases. These data, how- 
ever, have not yet been analyzed. 
The forthcoming fifth volume of 
Genetic Studies of Genius will contain 
extensive data on the relation be- 
tween intelligence and number of 
offspring of the gifted subjects them- 
selves.® 

In connection with a slum clear- 
ance survey, Dawson (28) adminis- 
tered the Stanford-Binet to 1,239 
Glasgow children, aged 3 to 14. Most 
of the children were tested just before 
their families moved to a new hous- 
ing development. The correlation 
between these I1Q’s and the number 
of living sibs was —.12; within a sub- 
group of 140 completed families, the 
correlation was —.10. The corre- 
sponding correlations with total num- 
ber of births (including stillbirths) 
were —.20 and —.30, respectively. 
O'Hanlon (71) reports the results of 
retesting 293 of these children eight 
vears later. This group vielded a 
correlation of —.137 between 1Q and 
number of living sibs, the correlation 
rising to —.291 in a subgroup of 28 
completed families. The correspond- 
ing correlations with total number of 
births were —.207 and —.413. The 
results of both Dawson and O’Han- 
lon illustrate the fact that differential 
fertility is partly offset by a differ- 
ential death rate during childhood. 

In an early study, Pearson and 
Moul (73) found no significant cor- 
relation between size of sibship and 
teachers’ ratings of intelligence 
among 1,202 Jewish school children 
in London. The children were largely 
of foreign-born parentage and came 
from relatively homogeneous and 
low socioeconomic levels. Fertility 
was uniformly high in the group as 


® Personal communication from Dr. Lewis 
M. Terman, April 28, 1955. 
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a whole. Sutherland (87) reports 
correlations of —.129 and —.126 
between IQ and family size in two 
samples of children of British coal 
miners totaling 3,096 cases. These 
correlations are consistent with the 
findings of other investigators inso- 
far as the relationship is somewhat 
lower in the poorest socioeconomic 
levels than in the intermediate levels. 

In a study of 354 children in a 
Jewish orphanage in New York City, 
Locke and Goldstein (56) found a 
correlation of —.13 between Stan- 
ford-Binet IQ and size of sibship. 
When both mother’s age and order 
of birth were partialled out, this cor- 
relation rose to —.24. It is rather 
difficult, however, to interpret this 
partial correlation. For compara- 
bility with correlations obtained in 
other studies, moreover, the original 
correlation of —.13 would seem pref- 
erable. 

Sutherland (88) analyzed the IQ's 


of two samples of fatherless children. 
The first consisted of 123 12- to 14- 


vear-old Glasgow school children 
whose fathers had died when the 
children were under one vear of age. 
The second sample comprised 724 
Yorkshire school children aged 11 to 
13 who were also fatherless but not 
necessarily from infancy. Each sam- 
ple was compared with a control 
group of children. both of whose par- 
ents were living. The mean IQ of the 
fatherless children lower than 
that of control children with the same 
number of siblings. The correlations 
between IQ and family size were sig- 
ficantly negative, but lower among 
fatherless children than among the 
controls. The respective correlations 
were —.188 and —.26 in the Edin- 
burgh samples, and —.19 and —.23 
in the Yorkshire samples. 

In discussing these results, Suther- 
land writes, “The tendency for the 
brighter children to come from the 


was 
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smaller families has been reduced by 
the presence of small families whose 
natural increase has been prevented 
by death and whose children would 
normally have had their place among 
the larger families’ (88, p. 168). A 
similar explanation may account for 
the lower IQ's of the fatherless chil- 
dren within each sibship size. The 
test performance of such children 
resembled that of children in larger 
families, to which they would have 
belonged if family growth had not 
been cut off by the father’s death. 
This type of study represents one 
step toward the analysis of the com- 
plex factors determining the correla- 
tion between intelligence and family 
size. Its findings suggest that it is 
not size of sibship per se but other 
factors associated with family size 
within a given culture which produce 
the obtained differentials in intel- 
lectual level. 


FoLLow-up STUDIES WITH 
INTELLIGENCE TESTS 

On the basis of the negative cor- 
relation between intelligence test 
scores and family size, several writers 
predicted a gradual decline in the in- 
tellectual level of the population (14, 
18, 19, 20, 54, 77, 79, 92, 100). The 
estimated drop varied from about 2 
to 4 10 points per generation. A 
direct test of such a predicted decline 
is provided by follow-up surveys in 
which comparable samples have been 
tested under similar conditions after 
a lapse of several years. The most 
extensive and best controlled data on 
this question are to be found in the 
previously described Scottish  sur- 
vevs (81, 82, 83, 93, 94). Rather 
than showing the predicted decline, 
however, the scores revealed a small 
but significant improvement from 
the 1932 to the 1947 surveys. 

Other similar follow-ups have been 
reported by Burt (14), Cattell (23), 
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and Emmett (33). In comparison 
to the Scottish surveys, these studies 
covered somewhat shorter intervals 
and used smaller samples of ques- 
tionable comparability. When test- 
ing is conducted within a limited area, 
such as a single county, city, or bor- 
ough, successive samples may diller 
owing to selective migration, dete- 
rioration of neighborhoods, slum clear- 
ance, changes in proportion of chil- 
dren institutionalized or attending 
private schools, and other similar 
factors. Moreover, ditlerences in 
thoroughness of sampling procedures 
in successive surveys may introduce 
differential selection. For example, 
subjects in lower socioeconomic levels 
and those who are physically less fit 
are more likely to be excluded when 
sampling coverage is less adequate. 
Despite certain local variations in 
results, however, these studies also 
failed to substantiate the expected 
decline in test score. 

Reference may likewise be made 
to certain relevant data gathered in- 
cidentally, as by-products from other 
types of testing programs. Thus an 
analysis of the intelligence-test per- 
formance of American soldiers in 
World Wars I and II indicated that 
the level of performance had im- 
proved to such an extent that the 
median score of the later sample cor- 
responded to the 83rd percentile of 
the earlier sample (95). This rise in 
score paralleled an increase in amount 
of education, the mean being at the 
8th grade for the first sample and at 
the 10th grade for the second. It is 
also noteworthy that a survey of the 
intelligence test performance of 
American high school students over 
a 20-year period suggested that this, 
too, had improved, despite the 
marked increase in the proportion of 
students enrolled in high school (35). 
Since a larger proportion of the total 
population was attending high school 
at the end than at the beginning of 
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this 20-year period, a decrease in 
mean score would be expected, unless 
the total population had improved 
sufficiently to counteract such a drop. 

When educational and other en- 
vironmental conditions within a given 
community improve conspicuously, 
larger rises in test scores may be ob- 
served over even shorter intervals. 
Thus a 10-year follow-up conducted 
in a relatively isolated mountainous 
region of eastern Tennessee showed 
a 10-point rise in median 10 (105). 
During this interval, the socioeco- 
nomic and educational conditions of 
this region had greatly improved. 
The second sample tested in this sur- 
vey was closely comparable to the 
first, the children coming largely from 
the same families in the two cases. 

Similar results are to be found in 
the previously cited slum survey con- 
ducted by Dawson (29). A subgroup 
of 289 4- to 9-vear-old children who 
had been tested with the Stanford- 
Binet at the time of their transfer 
from the slum area was retested after 
12 to 18 months of residence in im- 
proved housing. A control group of 
56 cases who had not been transferred 
from the slum area was also retested 
over the same interval. The slum 
clearance group showed a significant 
mean rise of 1.5 10 points, while the 
control group showed no significant 
change. To be sure, the mean IQ 
rise in the clearance group, though 
significant, was small. But it should 
be borne in mind that the retest was 
conducted over a relatively short 
period of time. Moreover, it is note- 
worthy that the improvement of the 
slum clearance group was also mani- 
fested on tests of arithmetic and 
reading, and persisted when com- 
parisons were made separately for 
boys and girls in all three variables. 
In the control group, on the other 
hand, about half of these compari- 
sons yielded positive changes and 
half negative changes. 
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That intelligence test performance 
is susceptible to improvement as a 
result of education has been repeat- 
edly demonstrated by a variety of 
methods (4, Ch. 8). Of special inter- 
est are two investigations in which 
persons who had been tested follow- 
ing a uniform period of universal 
education were retested after 10 
vears in one case and after 20 vears 
in the other (41, 58). In the interven- 
ing vears, members of these groups 
had received varying amounts of 
further education. Both studies re- 
port significant relationships between 
amount of subsequent schooling and 
retest performance, when initial scores 
are held constant. 

One of the hypotheses proposed to 
account for the rise in mean 
found in the Scottish survevs and 
similar follow-ups is that of test 
sophistication. Thus it is argued 
that the subjects tested in the later 
samples had the advantage of greater 
familiarity with psychological tests. 
With regard to the Scottish survevs 
themselves, a special analvsis of the 
scores obtained by children who had 
and those who had not been previ- 
ously examined with a similar test 
showed little effect of such test ex- 
perience (83, pp. 121-124). In a 
later and more extensive study of the 
same question, two subgroups within 
the 1947 Scottish sample were com- 
pared. One was taken from Educa- 
tion Authorities in which only 2.5% 
of the children were reported to have 


score 


had previous test experience within 
the vear of the survey; the other in- 


cluded Educational Authorities in 
which 67°; of the children had had 
such experience. The mean rise in 
score from 1932 to 1947 in the two 
subgroups was 1.5 and 3.4 points, 
respectively, both being significant 
at the .01 level.’ 

7 Personal communication from Mr. James 


Maxwell, University of Edinburgh, May 17, 
1955. 
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Fundamentally, the questicn of 
test sophistication and the related 
question of coaching must be an- 
swered in terms of breadth of im- 
provement (cf. 1; 2, pp. 52-56). Any 
influence which is restricted to the 
test performance itself and does not 
correspondingly affect the criterion 
behavior which the test is designed 
to predict would of course reduce the 
validity of the test. But the broad 
social and educational changes which 
have occurred in the time intervals 
under consideration can be expected 
to affect the individual’s over-all 
intellectual development, rather than 
being limited to the particular be- 
havior samples covered by specific 
tests. 

Carrying the argument a bit fur- 
ther, some writers have argued that 
‘true, innate” intelligence may still 
have declined, despite the rise in in- 
telligence test performance brought 
about by improved environmental 
conditions. Some have gone so far as 
to suggest that if test scores show 
improvement under such conditions, 
then the tests are at fault. This is 
analogous to arguing that the “true, 
innate” height of the population has 
declined as a result of the negative 
correlation between height and fam- 
ily size, and that the observed rise in 
mean height is illusory. Moreover, 
it could then be argued that we 
should devise a “culture-free’’ meter- 
stick to measure innate height freed 
from environmental influences. Such 
reasoning does not make scientific 
sense. When the psychologist speaks 
of intelligence, he refers to certain 
properties of observable behavior. 
Such behavior is by its very nature 
susceptible to environmental influ- 
ences. The same influences which 
bring about a rise in test score also 
cause an improvement in the quality 
of the individual’s general intellectual 
functioning (cf. 31, 75, 100). 

Penrose (74, 75, 76) has argued 
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that, even from a genetic point of 
view, the negative correlation be- 
tween intelligence test scores and size 
of sibship need not imply a decline in 
the intellectual level of the popula- 
tion. Taking into account assortative 
mating and the infertility of low- 
grade mental defectives, he presents 
a simplified genetic model by which 
intellectual level remains constant 
despite the existing fertility differen- 
tials. Essentially, such a situation 
results from the fact that some par- 
ents of average or borderline intelli- 
gence do have above-average chil- 
dren. In a much earlier article by 
Willoughby and Goodrie (110), a 
similar argument is presented. Be- 
ginning with a correlation of —.17 
between intelligence and family size, 
the authors set up a genetic model 
based upon the combined contribu- 
tion of five gene pairs. They then 
trace the effects mathematically 
through five generations, using two 
marital coefficients, .37 and .58. With 
the former coefficient, a decline in 
mean intelligence results. With the 
latter, which is much closer to em- 
pirically obtained husband-wife cor- 
relations in intelligence test 
there is actually a slight rise in mean 
intelligence. 


scores, 


METHODOLOGICAL PROBLEMS 


Investigations on the relationship 
between family size and intelligence 
present a number of special meth- 
odological problems with regard to 
sampling and statistical analysis. A 
source of difficulty in several studies 
is the inclusion of incomplete families 
in the sample. This factor may oper- 
ate in a number of different and not 
entirely predictable ways. All chil- 
dren from such incomplete families 
are, of course, classified as coming 
from somewhat smaller families than 
will eventually be the case. This may 
have the effect of classifving lower 
test-scorers (from the ultimately 
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larger families) into a category of 
higher test-scorers (from smaller fam- 
ilies). Such a procedure would thus 
reduce the obtained differences in 
test scores between the various fam- 
ily-size categories. On the other 
hand, when a negative correlation is 
found between intelligence and fam- 
ily size in a sample of incomplete 
families, it may simply reflect the 
tendency for persons of lower intelli- 
gence to begin having children ear- 
lier. 
having children earlier than the more 
intelligent members of the commu- 
nity, the correlation would disappear 
when completed families are studied. 
In the previously reported investiga- 
tion by Conrad and Jones (25) in 
rural New England communities, 
such was actually the case. 

It should be noted, of course, that 
if the less intelligent parents begin 
having children earlier than the more 
intelligent parents, they will 
tribute more individuals to the popu- 
lation in the long run, through more 
frequent generations. For example, 
if each successive generation begins 
to bear children at age 20, there will 
be five generations per century; while 
if child-bearing begins at age 25, 
there will be enly four generations. 
Nevertheless, the effect upon the 
intellectual level of the population 
would be less if the duller parents 
merely began to have children earlier 
than it would be if the duller also had 
larger completed families. 
factor which 
needs to be considered in its own 
right. The smaller families in a sam- 
ple may include a relatively large 
proportion of incomplete families of 
younger parents. Such parents are 
likely to be better educated than the 
older parents of the completed fami- 
lies, owing to the progressive rise in 
educational level of the population. 
The better educated parents would 
in turn provide a more favorable en- 


Should such persons also stop 


con- 


Age of parents is a 
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vironment for their offspring. Thus 
when incomplete families are in- 
cluded in the sample, the differences 
in parental age, coupled with the ris- 
ing level of general education, may 
account at least in part for the nega- 
tive correlation found between fam- 
ily size and intelligence. 

A second major problem pertains 
to selective factors. Since the correla- 
tions in question are generally quite 
low, the operation of selective fac- 
tors, however slight, mav produce a 
completely spurious result. An ex- 
ample of such a subtle selective factor 
is given in Shuey’s study of students 
in an American women’s college (85). 
Within the entire group of 2,261 en- 
tering students for whom test scores 
were available, a significant inverse 
relation found test 
scores and number of siblings. This 
relationship however, 
when the records of 
older sisters had attended the same 
college were excluded. Presumably 
somewhat lower require- 
ments had been applied to those 
whose siblings had previously ma- 
triculated. Such students would also 
be likely to come from larger families. 

Nor are more general samples free 
from selective Thus in a 
sample of children between, let us 
say, the ages of 8 and 14, individual 
large families will appear repeatedly 
within the sample, the number of re- 
appearances being directly propor- 
tional to the size of the family. A 
family five 
children, for example, would appear 
five times in the sample. Any chance 
conditions characterizing individual 
large families would thus be spuri- 
ously magnified in the sample. Nor 
would ordinary sampling statistics 
be strictly applicable to such samples. 
It is partly to avoid such difficulties 
that single-age samples, such as 11- 
year-olds, have been utilized. 

Even carefully chosen single-age 


was between 


disappeared, 


those whose 


entrance 


factors. 


containing sé hool-age 
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samples, however, are not entirely 
free from selective bias. In such a 
sample, for example, large families 
will be overrepresented, since by 
chance a child of any specified age is 
more likely to be found in a large 
family than in a small one. Hence 
the sample is not representative of 
existing families, nor of the parental 
population. With regard to the child 
population, it can be argued that 
large families should be overrepre- 
sented, since they contribute more 
members to the population. Never- 
theless, certain types of comparisons 
may be distorted through the use of 
The reports of 
the Scottish survey contain a discus- 
sion of the possible effects of such 
selective biases upon the analysis of 
birth order (82) and maternal age 
(83) in relation to child’s intelligence. 

Parenthetically, it may be added 
that studies on the relation of birth 
order to intellectual and other psy- 
chological characteristics have fre- 
quently vielded ambiguous and in- 
consistent results because of the 
failure to take family size into ac- 
count. Let us suppose that one group 
consists entirely of two-child families, 
while a second group contains only 
four-child families. Obviously, in the 
first group 50 per cent of the children 
will be first-born, while in the second 
there will be only 25 per cent. In the 
Scottish survey (82), the child’s posi- 
tion in his sibship was recorded as 
14, 75, etc., signifying the first-born in 
a sibship of four, the second-born in 
a sibship of five, and so forth. Com- 
parisons were then made within each 
family size. Under these conditions, 
the first-born and last-born scored 
higher than the intermediate sibs. 
But even these results are inconclu- 
sive because of the previously men- 
tioned sampling biases. 

The computation of predicted in- 
tellectual decline involves question- 
able assumptions. The usual pro- 


a single-age sample. 
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cedure is based essentially upon the 
comparison of two means obtained 
from the same set of 
e.g., 14, 19). The mean of the sub- 
jects’ scores can of course be com- 
puted directly. The mean of the 
next generation is estimated by mul- 
tiplying the frequency in each class- 
interval of the 1Q distribution by 
mean family size in that interval and 
using these new frequencies to find 
the mean. Similarly, the mean score 
of the parents of the present subjects 
is estimated by dividing the original 
frequencies by family size and mul- 
tiplving by two. 

Such a procedure is based upon the 
assumption of a perfect parent-child 
correlation in both intelligence test 
score and family size. In neither case 
is the assumption consistent with 
available information. Parent-child 
correlations in intelligence test scores 
are generally in the .50’s (cf. 4, pp. 
318-320 With regard to fertility, 
parent-child discrepancies may arise 
from various sources, including the 
fertility differentials 
among socioeconomic classes and the 
social mobility of individuals. Avail- 
able data indicate that when persons 
move from one culture or subculture 
to another, their fertilitv tends to be 
intermediate between that of the 
group from which they have come 
and that of the group which they 
join. 

Another noteworthy methodologi- 
cal point pertains to the definition of 
mean family stze. When size of sib- 
and 


these sizes are averaged, the mean 


scores (ct., 


decrease in 


ship is recorded for each child 


family size per child is obtained. On 
the other hand, when number of chil- 
dren in each family is recorded, 
a census survey, the average of these 
numbers gives mean family size ; 
family. Thus if we have a sample 
of 5 families consisting of 1, 2, 3, 4, 
and 5 children, respectively, the me- 


as in 


ber 
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dian family has 3 children. But the 
median child, out of the total 15 in 
the sample, belongs to a family of 4. 
The corresponding means are 3 for 
families and 3.67 for children. The 
latter is the contraharmonic mean 
discussed by Jaspen (44, 45) and 
bears a direct arithmetic relation to 
the former. If XY represents number 
of children in each family and N 
represents number of families, the 
mean per family is =X/N and the 
mean per child is {X*/2X. In mak- 
ing comparisons between mean fam- 
ily sizes of populations or segments 
of a population, it is obviously essen- 
tial to insure that the same type of 
mean is emploved. This point is 
especially likely to arise when results 
obtained with a child sample are’com- 
pared to 
since the former are usually expressed 


national census figures, 
as mean per child, the latter 
per family. 

Attention should likewise be 
to certain problems dealing with the 
choice and application of psychologi- 
cal tests in studies on fertility and 
intelligence. An feature of 
the design of the Scottish surveys in- 
volved the transmutation ef scores 
from a group test, admi 
the entire sample, to Stanford-Binet 
IQ's obtained from a cross section of 
the larger sample. 
felt that the scores of the entire sam- 
ple would gain in meaningfulness if 
put in terms of such 1Q’s. Apart from 
the wider familiarity of the 1Q, how- 
ever, it is difficult to see wh 
Moreover, 


as mean 


given 


iIntrinsi 


; f 


istered to 


It was apparently 


it sucha 
conversion accomplishes. 
a number of specitic objections can 
be raised against this procedure, 
Since ditferent intelligence tests meas- 
somewhat different 


individuals « 


ure a combina- 
tion of functions, innot 
be expected to retain the same rela- 
tive standing on such tests. Intelli- 
gence tests vary not only in score 
scale and in normative population, 
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The reliability 
converted scores 


but also in content. 
and validity of the 
depend upon the properties of the test 


which was actually administered, 
rather than upon the properties of 
the test into which the scores were 
converted. For example, if an indi- 
vidual test is more precise because 
rapport and 
testing time, or 
these 
not transferred to 
the group test by transmutation of 
Nor is the use of the tradi- 
ional ratio IQ advisable for 
Stat 


of its better control of 


motivation, longer 
anv other favorable conditions, 


characteristics are 


scores. 
such 
urposes. dard scores, expressed 
as deviation IQ's or in some other 
onveniet iorn h » far more 
recommen 

he exclusive use of g 


ence in such studie 


re more highly 


uly size than spatial 


' 
\ 
| 
} 
i 


aptitudes, and whet 


ler 
1 mean score found in follow- 
vs is uniformly high in all 


intellicence. 


up surve 


aspects ot 


ANALYSIS OF CAUSAL RELATIONS 
Even with similar results, different 


tors have 


terpretations 


Investig offered varied in- 

correlation be- 
and 
It is especially important in this con- 
differentiate between 
plausible hypotheses and empirically, 
established facts, and to ferret out 
tacit assumptions which may under- 
lie certain interpretations. 


tween intelligence family. size. 


nection to 


For exX- 
ample, if intelligence is defined in 
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terms of hereditary predispositions, 
it becomes logically impossible to 
investigate whether or not certain 
individual differences in intelligence 
mav be related to gene constitution. 
It is more consistent with scientific 
definitions which 
do not prejudge the data, but which 
permit the maximum freedom of in- 
terpretation. 

We may, of define the 
word “intelligence’’ to mean any- 
thir gy WE choose. All studies on the 
relation of intelligence to famils size, 
however, h ive emploved psvchologi- 


method to emplor 


course, 


cal test scores as their index of intel- 
ligence. it is therefore evident that 

pretations must take such 
test scores as their point ol departure. 
When intelligence such 
terms, we ma\ ize three dis- 
tinct etiological mechanisms whereby 
the obtained correlations between 
family size might 
cf. 3, 59). It is understood, 
of course, that the actual causal rela- 
tions may involve anv combination 
of two or all three of these mecha- 
nisms. 


is defined in 


recog! 


inte lliger ce af d 


result 


First, there may be tnhertted struc- 
| factors neural, glandular, etc.) 
which straints, reducing 
adaptability of behavior and limiting 
the sort of intellectual development 
measured by current intelligence 
tests. The less able parents would 
thus transmit their hereditary limita- 
tions to their offspring. The obtained 
correlations would then result from 
the fact that, within a given culture 
or subculture, persons with inferior 
heredity tended to have more off-° 
spring. 

A second explanation attributes 
individual 


abilities to psychologic al 


te , 
tural 


serve as con 


children’s 
dif Ferences 
im the environments provided by par- 
ents of varving intellectual levels. 
In this case, the correlation between 
family size and intelligence of off- 


differences in 
} 
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spring would again result from a tend- 
ency for the less intelligent parents 
to have more children, but heredity 
would not be involved. Differences in 
intellectual level among the parents, 
as well as among the offspring, could 
thus have resulted from 
mental factors. 

In this connection, 


environ- 


mention should 
also be made of the point of view 
represented by Vernon (101). Ina 
discussion of the use of intelligence 
test scores in population studies, he 
urges the need for caution in inter- 
preting such scores because of the 
effect of differences in cultural pat- 
terns upon test performance. Such 
cultural differences, however, may 
affect only test 
the individual's over-all 


also 
intellectual 
development. The real question per- 
tains not to inadequacies of available 
measuring instruments but rather to 
the interpretation of the existing be- 
havioral differences 

It may be added that the indi- 
vidual’s intellectual development may 
be influenced, not only by the nature 
and extent of direct intellectual stim- 
ulation offered by his home, but also 
by emotional and motivational fac- 
from the “social 
mate’ of the home. There is a grow- 
ing body of data pertaining to differ- 
child-rearing 


not scores but 


tors deriving cli- 


‘nees in practices in 
various levels and 
other subcultures (4, Ch. 23 and p. 
734). Some of these ditferences, such 
as the de gree to which verbalization 
exploratory behavior are en- 
couraged | or discouraged, are likely 
the child’s intellectual 


SO i¢ re CONE mK 


and 


to influence 
development. 
A third possible interpretation of 
obtained correlations betweet 
intelligence and fertility is based upon 
size of family itself as a causal factor. 
For example, a larger family—at 
least in certain SOt loeconomic levels 
would reduce the per capita funds 


1 
the 
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available for education, recreation, 
suitable housing, proper food, medi- 
cal attention, and other environ- 
mental requisites. From a psycho- 
logical viewpoint, another important 
factor is the degree of adult contact 
provided in families of different sizes. 
Available evidence suggests, for ex- 
ample, that such contact may be the 
most important single factor in lin 
guistic development (4, pp. 335- 
339; 62). And it is well known that 
verbal ability plays a major role both 
in educational progress and in intelli- 
Whatever 
involved, the 
under considera 
tion is independent of the intellectual 
level of parents. A crucial test of this 


gence test performance. 
the 
causal 


factors 
mechanism 


spec ific 


hypothesis would thus be found in 
the correlation between size of sibship 


when 


held 


and intelligence of offspring, 
intellectual level of 
constant. 


parents is 


One additional point may be con 
sidered. Insofar as family size itself 
may be a factor in intellectual de- 
velopment, the general 
family size in the period covered by 
most follow-up surveys is notewor- 
thy. Quite apart from fertility dif- 
ferentials, such a decrease may have 
the observed 


decrease in 


operated as one factor i 
rise in intellectual level 

It is evident that the three hy- 
potheses difter significantly in both 
their theoretical and prac tical impli- 
cations. On the basis of existing data, 
it is impossible to choose among them 
or to determine the relative contribu- 
tions of each of the three ty pes of in- 
fluences. The complex interaction of 
many variables 
causal relations difficult in this area. 
A few of the previously cited results, 
however, have a bearing upon causal 
interpretations. Thus Sutherland's 
study of fatherless children  sug- 
gested that family size per se is insut- 
ficient to account for the obtained 


makes analvsis of 
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correlations between family size and 
intelligence (88). On the other hand, 
the findings on twins in both the Scot- 
tish (83) and French (36, 90) surveys 
are consistent with interpretations in 
terms of adult contact. The results 
of Tabah and Sutter (36, 90) on inter- 
sib interval likewise seem to support 
an explanation. The fact that 
studies of parents have so far failed 
to demonstrate a negative correlation 
parent | 
number otf offspring is 
with the third 
ibove. 
Vernon 
stud, 
whose 


such 


and 
line 


hypothesis outlined 


between intelligence 


also ili 


101 
of orphans and foster children 
true parents’ families differ 

Such an approach would ut 
doubtedls provide crucial data on 
t} 


he tirst hy pothesis, 


recommends the 


Im size 


which concerns 
the contribution of hereditary factors 
to the betwee! 


This 


obtained correlation 


intelligence and family. size. 


type of investigation, however, would 


be h indicapped bv a number of prac 


such as the lim 
in number and range of ava 


Moreove = 


and 


tic il obstacles, mation 


thable 
cases. family size of or 


phans foster children would be 


artificially restricted by 


such condt- 
tions as death of one or 
and illegitimacy 

A study 
test the 
the 
family size is reported by Nisbet 
70). Three data 
tained on several thousand Aberdeen 
First, in the effort 


both parents 
specihe ill, designed 
ntal contact 


and 


role of pare 


association of intelligence 


pes were ob 
school children. 
to gauge the contribution of language 
the correlation be- 
family performance 
on the ordinary verbal type of intelli- 


development to 
tween size and 
gence test, the latter correlation was 
found with language test scores par- 
tialled out. This partial correlation 
—.04. When family size 
and language test score were cor- 
related, with intelligence test score 


was onlv 
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partialled out, a higher correlation 
( —.11) Was obtained. Secondly, ver- 
bal intelligence tests vielded signiti- 
cantly higher correlations with fam- 
ily size than did such nonverbal tests 
as Raven's Matrices. 
Thirdly, the correlation between in- 
telligence test scores and family size 
rose with age. The latter finding was 
obtained in both a cross-sectional 
study of the large sample and a longi- 
tudinal study of a small 
From the results of these three ap 
proac hes, Nisbet con luded that part 
ot the negative correlation between 
family size and intelligence may be 
attributed to the effect of sibship size 
upon verbal development, which in 
turn influences intellectual develop- 


Progressive 


sample 


ment. 
study on 288 adult 
Nisbet (80) again 


the negative correlation 


In a later 
women, Scott and 
that 
between 


found 


size of sibship and intelli 
verce test score is lower for nonverbal 
than for verbal Such 
corroborate Nisbet's findings on the 
child sample described above, and 
that the environmental fac 
tors associated with family 
aliect test 
adulthood. 

It should be 
chological etfect of family size upon 
intellectual development may be 
curvilinear rather than 
Moreover, the direction in which in- 


tests. results 


sugvest 
size may 
performance well into 


noted that the psy- 


‘ 1; 
rectihnnear. 


size intluences intel- 
lectual development may itself differ 
with concomitant psvchological and 
social circumstances. Attention should 
likewise be given to the possible ef- 
fects of family upon 
lectual characteristics, 
cialization, 


creasing famils 


nonintel- 
such as so- 
emotional 
security, leadership, and other aspects 
Available 
research on this question has vielded 
conflicting and ambiguous results (9, 


10, 11, 26, 27, 32, 55, 57, 86). 


size 
cooperation, 


of interpersonal relations. 


Some 
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studies provide evidence that certain 
desirable emotional characteristics, 
as well as acceptance by associates, 
may be positively related to member- 
ship in relatively large families (9, 
11,27). Most of the results, however, 
are complicated by the correlation 
between indices of social adjustment, 
on the one hand, and both intellectual 
and socioeconomic level, on the other. 
Ditferences in family size 
ethnic subgroups within a sample are 
another source of confusion in such 


among 


data. 

In recognition of the methodologi- 
cal and interpretative complexities 
of the problem, there is 
interest in the design of more nearly 
definitive investigations on the rela- 
tionship between intellectual level 
and fertility.’ Ideally, such investi- 
gations should begin with the testing 


a growing 


of voung people prior to their educa- 
tional and vocational differentiation, 
i.e., after all have completed a uni- 
form period of required schooling. 
Preferably the test should consist of 
a differential aptitude battery vield- 
ing a profile of scores rather than a 
The subjec ts 
until all or 


single global measure. 

should be followed up 
nearly all of their families are com 
pleted. Age of both parents at the 
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birth of their first and last child 
should be recorded. Data should also 
be kept on deaths, unmarried per- 
and childless In- 
formation should likewise be gathered 


sons, marriages, 


regarding occupation, income level, 
and amount of subsequent education 
for each member of the group. It 
interest to obtain 


mobility, 


would also be of 


indices of social such as 
changes in occupational, educational, 
or income level within the subject’s 
own life, as well as dit be- 


and that of his par- 


lerences 


tween his status 


ents. 
Additional 
swered if 
of these | 
many problems can be solved 


questions can be an- 
the children 
but 


even 


test scores of 


persons are studice 


prior to this step. From a practical 
viewpoint, such a program is not 
realistic, espe ially in nation 
uniform psychological tests are 
ministered in the school s\ 
where detailed censt 
larly collected. From a theoretical 
standpoint, this approach would 


the many 


stem 
is data are regu 
help 
to separate interrela 
variables which are now int: 

intertwined, and should thu 
closer to a causal interpr 
the empiri 


between intelligence at 


lly observed relation 


d family 
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STUDIES OF BRAIN 
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In 1924, Charles Dunlap (24) 
stated quite unequivocally that there 
were no organic lesions or histopatho- 
logical findings which could be shown 
to be characteristic of schizophrenia. 
According to him the changes which 
had been reported were not essen- 
tially different from cell 


changes found in nonpsychotic per- 


nerve 


sons. This rather sweeping conclu- 
sion was based on his comparison of 
the brains of 
patients with those of four ‘‘control”’ 
His group 
sisted of a bootleeger who had been 
shot the lung and died 
shortly thereafter, two men who died 
of arsenic poisoning 
who died of appears, 
contrary to Dunlap’s statement, that 
there is room 
mality” of 
rate, this pronouncement apparently 
inhibiting effect on further 
itions into the role of organic 
Few 
investigations were undertaken in the 
cecade following Dur lap's authorita- 
tive statement. In 1930, 
(33 Dunlap’s conclusions 
but was willing to report finding a 
deficiency of catalytic iron in the 
nerve cells of schizophrenics. Here 
again seems to be evidence of the 
profound effect of a statement backed 
by an authority figure which blocked 
progress in an area deserving of con- 
tinuing experimental study. 
Perhaps the electroencephalogra- 


eight s< hizophrenic 


subjects. control con- 


through 


. and one woman 


peritonitis. It 


the ‘‘nor- 


At any 


to question 
such a group. 
had an 
investig 


1 
such 


factors in schizophrenia. 


Freeman 


quoted 


'From the Veterans Administration Hos- 
pital, Palo Alto, California 

21 would like to thank Mary Ann Swanson 
and Elizabeth Grisak for their helpful assist- 
ance. 
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tration, Palo Alto 


phers were unimpressed by authority 
or were not aware of it. At any rate, 
they appear to have been, in large 
part, responsible for rearousing inter- 
est in the search for an organic factor 
in schizophrenia. In 
recording the brain 
groups ot people, 
schizophrenics 


the process of 
waves of various 
they found among 
waves resembling 
those found in cases of known brain 


damage. Histopathologists and bio- 


followed 
leads so that the recent psychiatric 
literature presents a number of pro- 


vocative studies. 


chemists have up these 


It is the purpose of 
this review to present some ol these 
recent findings and to suggest some 
hypotheses. 

While it is probably true that most 
of the work in the phi siology, cenet 
ics, and psychology of psychoses is 
at least indirectly 


ject of brain dysfunction, the electro- 


related to the sub- 


histopathologic al, 
both 


encephalographic , 
and biochemical 
directly relevant at 


stuclic Ss 
1 of 


themselves 


are 
suthcient 


importance in to merit 


concentrated The review 


will include these studies as well as 


attention 
several psychological and phvsiologi- 
cal studies which focus specifically on 
the question of ler in 
schizophrenia. Studies made within 
the last twenty to twenty-five years 
will be included. 

It is felt that psychologists could 
make a worth-while contribution to 
the studies of possible central nervous 
system pathology schizo- 
phrenics. Measurement of organic 
damage by various kinds of psycho- 
logical tests has been rewarding to the 
extent that tests have been shown to 
be useful in research and in theoreti- 


brain disor 


among 
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cal formulations about the nature ot 


thinking in cases of cerebral damage. 


In many clinics and hospitals neurol- 
ogists have come to depend on psy- 
choke gical the 
study of central nervous system in- 
volvement. It is hoped that this re- 
view will stimulate research interest 


tests in diagnostic 


on the part of psychologists who are 
concerned with the physiological cor- 
relates of behavior. 
EEG STupDIEs 

In 1937, Travis and Malamud (76) 
reported a study unfavorable to the 
hypothesis that schizophrenics show 
patterns different from the normal. 
These investigators were apparently 
this 


nore 


only secondarily interested in 
possibility they 


interested in testing some of Berger's 


since were 
the na- 
ture of the EEG. However, they did 
find no differences in either amplitude 
or frequency of the waves of nine 
Ss hizophrenics compared with those 
ot a group of seventy-five 


original formulations about 


normals 
and a group of stutterers. They were 
aware of the small size of their group 
of schizophrenics, and are tentative 
in their conclusions. 

In the vear, Gibbs, 
Gibbs, and (34) found that 


epilepsy is present about three times 


follow ing 


Lennox 


as often among s 

the general population. 
point out that their experiet ce shows 
that the majority of schizophrenics 
they have tested show abnormalities 
of rhythm. They found both fast and 
slow EEG patterns. Although they 
do not present statistics, they were 
impressed by the similarity of many 
of the records of schizophrenics to 
those of psychomot« r epilepsy. This 


hizophrenics as in 
Gibbs et al. 


finding should arouse some interest 
in the possibility of an organic factor 
in those cases in which an epileptic 
EEG is found. The meaningfulness 
of such a question would, however, 
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depend upon the acceptance of an 
organic basis for psychomotor seiz- 
ures. 

Jasper, Fitzpatrick, and Solomon 
(48) studied the EEG's of 82 schizo- 
phrenics, so diagnosed by agreement 
psychiatrists small ac- 
complishment in itself). They also 
included a group of 51 epileptics and 
60 normal subjects. They found, as is 
expected of schizophrenic records by 
now, no specific EEG pattern, but 
rather, a wide range of variability. 
Contrary to the study of Travis and 
Malamud, they found a variety of 
anomalies to a more extreme degree 
and in a_=greater 
schizophrenics than normal subjects. 
Specifically, they report that 23 per 


ot five (no 


number among 


cent of the cases diagnosed as schizo- 


phrenic showed either clinical or 
electroencephalographic evidence of 
brain activity similar to that 
acterizing the epileptic. 


hesitant in 


char- 
They are 
their conclu- 
sions about the role of organic factors 
in producing the abnormal EEG’s. 
They state, 15°) of our group of pa- 
tients schizophrenic 
may well be suffering from mental 
disorders due to head injury, mental 
deficiency, or other organic defects 
of the brain” (48, p. 849). 

Another study involving 52 chronic 
schizophrenics and 500 control cases 
reports essentially the same results. 
Davis and Davis (19) also feel that 
schizophrenic patients do not have a 
typical “schizophrenic EEG” in the 
same sense as, for example, the spike 
and slow wave of petit mal epilepsy. 
They found variations in wave pat- 
terns outside the normal 
subjects. They come to much the 
same conclusions as Jasper et al., but 
are more cautious in their concluding 
statement. “The quality of the 
atypical variations resembles that of 
the changes which occur in the elec- 
troencephalograms of persons known 


somewhat 


diagnosed as 


range of 
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to have brain lesions or who suffer 


from the various forms of epilepsy, 
or normal persons during sleep, or 


when breathing an inadequate supply 
of oxygen” (19, p. 1019). They per- 
mit the reader to draw his own con- 
meaning of their 
findings in regard to possible organic 
factors in some schizophrenics. 

In 1940, Rubin (70) published 
EEG studies of 14 s« hizophrenics, 2 


clusions about the 


manics, and 1 case of traumatic psy- 
Hie used the per cent time 
alpha as a measure. By studving the 
distribution of per cent time alpha 
over the various areas of the cortex, 
he felt that he was able to detect the 
of atrophy in 


chosis. 


presence these cases 
His diagnosis was confirmed in 10 
of the ¢ 


vranis. 


ises by pneumoencephalo- 
\ir studies had not been done 
Rubin 


states not only that there was central 


on the remaining subjects 


nervous system pathology 
but that the 
pathology was atrophy. 
results were confirmed by Moore, 
Nathan, Elliott, and Laubach (55). 
In the majority of their cases 
they found evidence ot atrophy, en 


present, 
this 
Rubin's 


also nature of 


SIXt\ 


largement of the ventricular system, 
and cisternae. A study of the films 
suggested that there was a selective 
atrophy involving the parietal lobe 
and insula. Unfortunately they do 
not report on the frequency of occur- 
It could be inferred that at 
least 31 of 
atrophy. 
these findings since the authors used 


rence 
such 
Again it is difficult to assess 


the cases showed 


no control group and hence could not 
the frequency of atrophy 
among nonschizophrenics 

¥ Davis 20 the LEG's ot 
132 schizophrenics in an attempt to 
work out 
schizophrenic EEG’s. She found that 
she could isolate three genera! types 
of patterns. Group | showed essen- 
tially no differences from the normal. 


evaluate 
studied 


a classification system for 


Group [1 showed dysrhythmias with 
out convulsive characteris 
II] contained 
gested the possibility of localized o1 
generalized irritative lesions, cysts 
atrophy, or other pathological condi 
tions. 


(sroup 


tics, | 


records which ( sug- 


Phis study again suggests that 
an appreciable number of 
phrenics show electrical activity of 
the cerebral has long 


been form ol 


schizo 


which 
with some 


cortex 
ASSO iated 
structural brain damage 
Finley and Campbell (31 
the EEG's of 500 schizophrenics and 
250 normals using a triple classifica 
previously 
Phe cate 
normal, bor- 
When these 
applied to their 


id 40 per cent ol 


studied 


tion similar to that of the 
nentioned investigator 
gorized the records as 
derline, or abnormal 
classifications were 
two groups they four 
the schizophrenics had normal 
ords while 70 per cent ol 
group had normal record 
eight per cent of the schizophrenics 
had abnormal records while 7 per cent 
of the normals had abnormal records 
ts had records 
in the borderline category. Thus the 
incidence of abnormal tracings amor 
ice that of the 
lev and Campbell 
the pat 


Phe remaining subjec 
g 


schize phrenic Ss Was twice 
normal group. Fis 
found that 30 
tients’ records 
dicative of 


per cent of 
showed tracings in- 
a disorder of function in 
Phe 
LEG 
schizo 
Wave ac 
not unique to this 


the central nervous svstem 


point out, however, the 


pattern is not diagnostic of 
phrenic Ss since the kin 
tivity 
group. 

Kennard and Levy (50 
100) schizophrenic 


exactly the same 60 per cent of ab- 


found are 


found 
among patients 
normal patterns as reported by Fin 
lev and Campbell. Thev looked for 
correlations the abnormal 
patterns and the patients’ clinical 
pictures. The incidence of abnormal 
EEG's was higher for those patients 


between 
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with early onset and prolonged ill- 
If the illness had lasted three 
the 
normal tracings was only 42 per cent 
as compared to 72 per cent if the ill- 
ness had persisted over ten vears. 
This relationship also seemed to hold 
in regard to the degree of intellectual 
impairment shown by the Wechsler- 
Bellevue Test It 
these findings are consistent with the 
| . the 

thought of as a 


ness. 


vears or less, incidence of ab- 


is suggested that 


; schizophreni 
can be] 
rrogressive disorder which ultimately 
muundly affects the pertormance ol 
nic systems including that of 

il cortex’ (50, p. 421 
winker and Serota (35 


hypothalamus in 


impl cate 
EEG 


re] ort 


their 
| he ir 


an hypothesis re- 


udy of schizophrenia. 


is in the nature of 


sulting from the observation of some 


1utonomiec reaction in schizophrenia 
' 


There was little electrical activity in 


the hypothalamus and cortex when 


subje cted to ex- 
found that 


itients 


cold 


their | 


ternal 


were 


| he, also 


electrical stimulation of the hypo- 


us failed to arouse activity in 
the 
urotics and normals an in- 


apparent. 


thalan 
that 

\n ong ie¢ 
crease inl 


Thev suggest that these | 


structure or in cortex 


activity was 
ndings indi- 
failure at the 
il level because the usual hypo- 


eate a physiologic il 
corth 
lacking. 


from 


thalamie driving, force is 
The could be 1 
these formulations that the th 


not 


inference ade 
ilamo- 
cortical circuits are functioning 
the cortex is deprived of im- 
The Tu- 
roup he ided by Lhe ath 38) has 


also s ovested that 


so that 
pulse s from lower centers. 
lar ole 
subcortical cen- 
ters may be implicated in_ schizo- 
They report spiking activity 
the thalamus in the 
region. EEG's from leads 
implanted by the Horsley-Clarke ap- 
paratus in subcortical areas of schizo- 
phrenic patients often showed the 


phrenia. 
from areas near 


st ptal 


213 


spiking activity generally believed 
indicative of structural damage. 
They apparently felt that if they suc 
ceeded in eliminating the spike by 
administering electric shock throug! 
the subcortical electrodes, the patient 
had a better chance for recovery than 
if the spike activity continued. 

One other EEG study is of inci- 
dental interest. Turner and Lowinger 
77) saw a marked difference in prog- 
nosis for recovery of schizophrenics 
given shock treatment, depending on 
the preshock EEG pattern. None of 
the patients with abnormal preshock 
waves was able to leave the hospital 
after treatment. One-fourth of those 
with normal patterns were discharged 
the tre Again, a 
relationship between the presence of 


following itment. 
abnormal wave activity and malig- 
nancy is suggested. 

In summary, it 
note the progressive changes in the 
significance ascribed to EEG findings 


is interesting to 


in the research on schizophrenia. The 
first studies emphasize lack of con- 
In the later 
studies, greater contidence is plac ed 
that 
demonstrated brain dvsfunction in an 
apprec iable number of cases of schizo- 
the 
investigators 


the nature 


sistency in the tracings. 


in the idea EEG studies have 


phrenia, despite variability in 
tracings. Finally, some 
have been will 
of the difficulty, i.e., atrophy of the 
cortex and or failure of thalani 

Phe histopathol 
gists have been much more specili: 
in their the 
nature of the brain pathology found. 
This larger group of studies will be 
next. 


to state 


cortical circuits 


statements regarding 


reviewed 
HISTOPATHOLOGICAL STUDIES 


One of the proponents 
of organic factors in schizophrenia is 
Eugen Bleuler (12). His monograph 
on the schizophrenias is no doubt still 
the outstanding text in the field. It 


persistent 
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was his early clinical observations of 
sensory disorders among such pa- 
tients that led him to suggest organic 
dysfunction of the brain. In 1930, 
Bleuler wrote a provocative essay 
(13) in which he tried to sort the 
psychogenic from the physiogenic 
in schizophrenia. It seems worth- 
while to quote part of his conclusions 
in regard to the role of the central 
nervous system in schizophrenia: 
Many cases of stupor, with their general 
prostration of the elementary psychic func- 
tions, conception and train of thought, often 
point clearly to brain pressure and, on autopsy 
tense edema of the pia or brain swelling is 
found. In the various forms of such deliria, 
the fundamental similarity of certain symp- 
toms or of the whole picture t 
genic conditi 
epileptic 
cannot be denied, and in all such « 


» other physio- 
ns, intoxication, fever psychoses, 
absene es, meningitis, en epha itis, 
ases we also 
tind in the autopsy histological alterations of 
the brain tissue which show some uniformity. 
But in all chronic cases too, decreases in the 
amount of ganglion cells and certain changes 
in the glia, furnish proof that we are in the 
presence of a brain lesion. .."’ (13, p. 206). 


There are serious difficulties in ex- 
perimental histopathology, many of 
which have not been overcome. For 
instance, 74) points 
out, it must be remembered in using 
necropsy material that the diseases 
which may themselves 
bring about the brain. 
\lso, it is not known what the agonal 
process itself may 


as Spielmever 


cause death 


changes in 


do to brain cells. 
Neither of these variables has been 
horoughly explored. It might be 
added that although biopsy material 
can now be obtained easily at the 
time of prefrontal lobotomy, control 
material from normal subjects is not 
so readily procured. If control ma- 
terial is not obtained, it is impossible 
to assess the extent to which the re- 


sults may be attributed to the experi- 


mental procedure. Many of the stu- 
dies to be reviewed in this section 
have not overcome these difficulties 
altogether. Hence, the degree of con- 
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fidence to be placed in the conclusions 
of these studies is necessarily limited, 
as is also the ceneralizability of the 
results. 

In 1930, Spielmeyer (74) recog- 
nized these potential sources of error, 
but still felt justified in concluding 
that characteristic histological 
changes were evident in 
phrenia. The finding he emphasizes 
is the loss of cells in the cortex. He 
also reports changes in glial cells and 
the presence of destruction products 
Hle concludes that there is sufficient 
evidence to definitely that 
schizophrenia is an organic process. 

In 1932, Bamford and Bean (6) 
studied a series of cases they describe 
These 


cases but 


schizo- 


state 


as acute dementia praecox. 
authors report on three 
base their conclusions on an unspeci- 
fied number of additional 
Their conclusion is quite different 
from Spielmeyer's. In fact, this study 
is one of the few since 1930 reporting 
completely negative results. (It 
should be remembered, of course, 
that negative results are less likely 
to be reported than are positive find- 
ings.) 

As is often found in the psychiatric 
literature, there are a number of case 
studies reported. Several of these 
will be reviewed even though they do 
not represent well-controlled nor 
truly experimental studies. It is 
perhaps not necessary to state that 
fruitful hypotheses may be derived 
from observation of the single case 

Ferraro (29) reports two cases di- 
ignosed dementia praecox. He found 
the oligodendroglia in 
both cortical and subcortical tissue, 
but enlargement was most prominent 
in frontal and temporal areas. He 
ilso noted demyelinization, perivas- 
cular infiltration, and the 
of gitter cells. These were taken to be 
indications of a degenerative process 
not to be expected in the normal 


cases. 


swelling of 


presence 
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brain. Ebaugh, Barnacle, and Neu- 
berger (25) detected histological al- 
terations in fatalities resulting from 
electroshock therapy. They found 
changes in and in the 
neuroglia, and suggest that although 
these modifications may have been 
related to the shock they were not 


limited to areas of strongest current. 


astrocytes 


Their conclusions are not explicitly 
stated, but they imply that shock 
alone could not account for the brain 
(30) reports 
other case which had been diagnosed 
as schizophrenia for more than ten 

The clinical picture 
that of a schizophrenic psv- 
but on autopsy j 


i@- 


damage. Ferraro an- 


years. was 
cle irky 
chosis a diffuse « 
myelination of the brain was found. 
The disease was apparently so wide- 
spread and progressive that it could 
s ircely be attributed to a sudden ill- 
in, and Bar- 
66) describe two typical cases 
of chronic schizophrenia, similar to 
those of Ferraro. Their discovery of 
slow wave activity in the EEG's was 
the first clue to the presence of or- 


ness. Volatin, Eisenstein, 


rera 


ganic dan age; 


pneumoencephalog- 
‘raphy revealed cerebral atrophy; and 
! organic 


he Rorschach suggested 
brain disease. Bi psyv studies revealed 


the authors 


irreversible 


what considered to be 


changes in the cerebral 
cortex, 

Moriaritv, 
the 
woman who died after 


psychosis di 


and Weil (68) 
34-vear-« ld 
a brief acute 
catatonic 
Here, as in Ferraro’s 
research, study of the brain tissue re- 
vealed a process in 
the central These 
authors interpret their findings not 
as indicating that schizophrenia ts an 
but that impaired 
brain function acts as a precipitating 
factor for a schizophrenic reaction. 

Holt and Tedeschi (45) and Van 
der Horst (78) report—as did the 


» 

Roizan, 
exan ine case of a 
agnosed as 


schizophrenia 
demvelinating 


nervous system. 


organic process 
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previous investigators—the presence 
in individual cases of demyeliniza- 
tion, microglial and oligodendroglial 
hyperplasia, loss of cells, and vascular 
infiltration. Additionally, Holt and 
Tedeschi detected cytoplasmic in- 
volvement in the nerve cells them- 
selves along with nuclear disturb- 
ances and lipoid deposits. 

One of the interesting and 
best controlled studies was made in 
1944 by Kirschbaum and Heilbrunn 
(51). They took biopsies from the 
frontal lobes of eleven chronic schizo- 
phrenics and also from three normal 
subjects. Biopsies from several cats 
provided further control 
For the experimental ma- 
terial they report ganglion cell de- 
generation as well as alterations in 
glial cells and blood vessels. These 
changes are similar to those found in 
chronic intoxication and metabolic 
disorders. Comparison of experi- 
mental and control material indicated 
that the results could not be attrib- 
uted to the experimental proce- 
dures. This study seems most im- 
pressive because many of the method- 
ological flaws of other studies were 
The tissue was taken from 
the living organism and studied im- 


most 


rats 
material. 


and 


avoided: 


mediately, thus ruling out the com- 
plicating effects of the agonal process 
as the disease which might 
cause death. Also, the use of con- 
trols in this study held constant any 


as well 


effects the anesthetic and surgical 
procedures might have had. 


Another well-executed _ biopsy 
study is reported by Elvidge and 
Reed (27). They used material from 
19 psychotic patients, 13 of whom 
were schizophrenic. Biopsies were 
also obtained from 16 control sub- 
jects, mostly epileptics, and from 14 
laboratory animals. They too report 
for the schizophrenic subjects changes 
in glial cells, with hypertrophy and 
swelling. In some cases the oligoden- 
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droglia showed nuclear modification. 
These changes were most apparent 


in the white matter; in some cases 
they were generalized, while in other 
cases they appeared more patchy. 
Similar changes were found for those 
control cases in which the patient 
showed clouding of consciousness be- 
tween epileptic seizures, but not tor 
those cases in which the patient re- 
mained clear and oriented between 
seizures. A number of the psychotics 
were followed over periods of from 
one to two vears, and the biopsies 
were repeated, The findings were the 
It is unfortunate that 
remitted be- 
tween biopsies, for if it could have 


same as before. 


none of the 


cases had 
been demonstrated that the changes 
had slowed down or ceased in recov- 
ered cases, it would have been an im- 
pressive argument for the Importane e 
of organic tactors in schizophrenia. 
| Iwidge and Reed suggest that their 
results indicate disturbances or in- 
terruptions of commissural pathways 
in the brain, and that it is this ‘‘dis- 
ruption’ of impulses to and from var- 
ious parts of the brain which causes 
disturbances in intelle tual and emo- 
tional reactions. 

Winkelman Book 80 
a post-mortem sti dy of 10 cases of 
“typical” Their tind- 
focal cell loss, 


) a general decrease in the number 


and made 
schizophrenia. 
listed as (a) 


of nerve cells, especially in the an- 
terior half of chronic 


cell diseases, including cell shrinkage 


the brain, ( 
and ghost cells, (d) amount of fat in 
cells usually greater than expected, 
mild 
general demvelinization in subcortex. 
They that 
in the brains of their patients were 
much 


e) increase in astroglia, and Uf 


claim the changes found 


more severe quantitatively 
than those usually found in nonpsyv- 
chotic cases. 

From another post-mortem study 


come similar results. Rupp and Wil- 
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son (71) studied 37 patients, all under 
fifty years of age. Biopsies revealed 
loss of cells, gliosis, atrophy, edema, 
and areas of soften- 
ing. The authors do not report data 


for control subjects. 


vascular lesions, 


Probably the most persistent in- 
vestigators in the histopathology of 
schizophrenia are Papez, and Papez 
and Bateman (57, 58, 59, 60, 61, 62, 
63). Their methods and techniques 
are carefully detailed and appear to 
meet the standards of good research, 
with the exception that the extent to 
which they have studied control ma- 
terial is not always made 
In 1944, Papez (57 


dark 


explicit 
reported finding 
staining particles, inclusion 
nerve 


According to him, these parti 


bodies, in the cytoplasm of 
cells. 
cles were not artifacts produc ed by 
the technique but were living micro- 
organisms of a parasitic nature. In 
1949, Papez and Bateman (59) made 
another study of biopsies. In their 
sample of 50 patients, there were 42 
schizophrenics, 6 manic depressives, 
and 2 syphilitics. They 


three stages of cell disease. In 


identified 
the 
first stage, the cells showed intact ev- 
a small number 
Cell 


| he ser ond 


toplasm on cupied by 
of the inclusion bodies nuclei 
showed some deformity 
revealed highly 

clei, enlarged nucleoli, and evtoplasm 
filled with The 
third stage characterized by 
naked nuclei stripped ot cytoplasm 


stage vesicular nu- 


inclusion bodies 
was 


The inclusion bodies had apparently 
invaded the walls of small blood ves- 
sels, produc ing degenerative ( hanges 
The authors studied these inclusion 
bodies under a dark phase microscope 
and concluded that they were dealing 
with a living organism. They were 
able to watch these zooid organisms 
in various phases of the life cycle and 
to keep them alive over a period of 
many hours in a suspension of cortical 
material. Drawings made from mi- 
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croscope slides are presented in the 
article. They indeed suggest an om- 
inous picture of parasitic infestation. 
Papez and Bateman propose and 
argue convincingly for the thesis that 
the histopathological changes re- 
ported by other investigators are 
associated with various phases in the 
life cycle of the organism. 
Schaderwald (72) confirmed many 
of the findings of the previous in- 
vestigators, using material from 34 
biopsies taken at the time of pre- 
frontal lobotomy. He states that his 
data is in complete agreement with 
that reported by Papez, and Papez 
and Bateman. However, because of 
the lack of controls, he is much more 
cautious about ascribing pathogno- 
monic significance to the findings. 
The must agree with 
Weinstein (79) in discussing the stu- 
dies of Papez, and Papez and Bate- 
man: “Merely the unusual nature of 
the claims must be allowed to 
intertere with open-minded consider- 
ations of Many 
questions remain which can be an- 
swered throuch carefully 
signed experimental work with ade- 
quate attention to controlled obser- 
(76, p. 549 
The thalamus and autonomic nerv- 


reviewer 


not 
the findings . 


only de- 


vations . 


ous system have also been implicated 
as sites of possible lesions in schizo- 
In 1939, Stein and Ziegler 
(75) reported a biometric analysis of 
24 patients and 1 normal 
The brains and thalami in the schizo- 
phrenic patients were smaller and the 
cell count lower than for manic-de- 
pressive patients in general. 
the differences 
statistically significant. The investi- 
gators felt that the small size of their 
sample contributed to the negative 
findings. 

For a group ot 16 paranoid schizo- 
phrenics, Bateman and Papez (7) 
found inclusion bodies in thalamic 


phrenia. 


subvyect. 


Hlow- 


ever, none of was 


cells as well as cell loss as high as 90 
per cent in the association nuclei. On 
this finding they based their proposi- 
tion that hallucinations and delusions 
are the result of abnormal nervous 
discharge of the thalamus to the 
cortex. Since sensory input from 
peripheral sense organs must 
through the thalamus, the presence 
of diseased cells there could distort 
the impulses to such an extent that 
the misperceptions characteristic of 
psychoses would be eX per ted, 


pass 


Two reports question the impor- 
tance ol histopathologi al findings in 
schizophrenia. In 1949, Rowland 
and Mettler (69) studied 22 schizo- 
phrenics and 1 manic patient. They 
hypothesized that if cell loss is actu- 
ally an important feature in psycho- 
sis, it should be positively correlated 
with Hlowever, 
their findings indicated no difference 
in cell count between a group hos- 
pitalized more than twenty-two 
months and a group hospitalized less 
than twenty-two months. They con- 
tend that the various methods of 
counting cells are frequently of ques- 
tionable reliability, and suggest that 
this fact in large measure accounts 
for the conflicting reports of cell loss 
Phe same opinion is voiced by Wolf 
and Cowan (81). They add that per- 
haps the positive results which have 


severity of illness. 


been reported can be explained on 
the basis of misjudgment of the limits 
of normal variation in brain tissue, 
and on the basis of methodological 
artifacts. They feel that available 


studies are not sufficient to permit 


accepting the thesis of a specific or- 
ganic factor in schizophrenia. This 
position is vigorously opposed by 
Winkleman in a discussion following 
their article. 

In summary, it can be stated that 
a number of different pathologists 
report similar results. A frequent 
finding is loss of nerve cells. Pneumo- 
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encephalographic reports of atrophy 
appear to support this impression. 
Other repeated findings are: the pres- 
ence of a demyelinizing process, de- 
generating nerve cells, and patho- 
logical changes in the various glial 
cells. It seems fairly well established 
that the brains of some schizophrenics 
show structural damage, but the sig- 
nificance of the damage is difficult to 
evaluate because comparable studies 
of nonpsychotic subjects are generally 
lacking. 


BIOCHEMICAL STUDIES 


So far, very few biochemical stud- 
brain function 
have been made. In 1930, Freeman 
(33). after agreeing with Dunlap that 
there were no histopathological find- 
ings in schizophrenia, reported a de- 
ficiency of catalytic iron in the brain 
cells of schizophrenic s. His sample 
included 35 schizophrenics and 16 
patients of other diagnoses 


ies of in psychosis 


Freeman 
suggests that the lack of this catalytic 
agent (which is necessary for the use 
may underlie some of the 

An additional inference 
that could be made from these find- 
ings is that the inability of the cells 
to take up oxygen would eventuate in 
their degeneration. 

Doust (23 
schizophrenics, one 
American, totaling 
studied the brains 


of oxygen 
syimptonis. 


used two samples of 
English and one 
87 patients. He 
by means of the 
spectros ope and photor le tric OXim- 
etry, found marked re 
in the effective capillary oxyvhemo- 
globin. In view of the extreme sensi- 
tivity of nervous tissue to anox 
the irreversibility of the 
Doust’s results seem quite 
important to the question of central 
nervous system i 

schizophrenia. Also, it is interesting 
to note that this investigator reports 
a much more profound deficiency in 
capillary oxyhemoglobin among the 


and duction 


i and 
consequent 


damage, 


involvement = in 
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so-called ‘“‘process’’ schizophrenics 
(1.e., classical dementia praecox than 
for any other group studied. The re- 
viewer feels that this finding has con- 
siderable significance for methods of 
selecting groups of schizophrenics for 
study. This point will be elaborated 
in the discussion to follow. 


PHYSIOLOGICAL STUDIES 


From England, Hoffer, Osmund, 
and Smythies (44) report the acci- 
dental discovery of adrenochrome, or 
pink adrenaline. This 
closely related to adrenaline, is a de- 


substance, 


terioration product of that hormone. 
Although chemically it is only slightly 
ditferent from 
on the body is 


adrenaline, its et 


quite different. It 


shows toxic effects in much the same 


For this rea- 
ght the 
investigators hype 


way as does mescaline. 
son, adrenochrome cau inter- 
est ol 
toxl¢ ity 
(The catatonic su 


thesizing 
as a basis for schizophrenia 
ip has usually 
been the object of such study, since 
they present many of the symptoms 
generally indicative of toxic disor- 
ders.) 
Hoffer, 
studied the effects of pu 


on a group of volunteer 


Osmund, and Smythies 
ik adrenaline 

rmal sub- 
jects. The results are striking. When 
given adrenochrome, ir 
showed behavioral svn ptomis of schi- 
zophrenia. 
jective reports cont 


mpressions that 


S ibsec ts 


their sub- 
rmed the observ- 
the, 


experience | 


Furthermore 


er’s were de- 
hallucina- 
EEG’s taken on the subjects 
the the 


arrhythmias epileptic 


lusional and 
tions. 
under 
showed 


influence of drug 
and 
half an 


administration. 


hour from 
The 


authors hy pothesize that when adren- 


patterns within 


the time ot 
ochrome enters the cerebral cells it 
inhibits carbohydrate metabolism 
and interferes with cell respiration. 
In schizophrenics, an excess of adren- 


ochrome may be produced, resulting 
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in autointoxication. One night also 
speculate about a possible connection 
between an excessive production of 
adrenochrome and the hypothesis 
that s¢ hizophrenics suffer a chronic 
hyperactivity of the sympathetico- 
adrenal system. 

Phe been 
considered the site of possible path- 


endocrine system has 
ology in schizophrenia for well over 
half a century. Every gland from sex 
to pituitary has at one time or an- 
other by various investigators been 
dith- 
been 


identified as the locus of the 
culty. The have 
tantalizing but equivocal, and the 


work done 


findings 


‘ this area would con- 
bibliography several times 
the present one. 
lies which ex- 


al llow- 


few of the stu 


‘ 


link endocrine and autonomi 


nervous svstem fur will be men- 


tint 
Cig 


tioned. 
Lioskins 46 P in 


literature on 


the 


inction in 


reviewing 
endocrine { 


uzophrenics, reports some of the 


work done over a period of twenty 
vears at Worcester. His observations 
led to the conclusion 


11 
ible Nn 


that in an ap- 
thyroid 

Some 
s( hizophreni patients seem to sutier 
from a lack of thyroid secretion; how- 


i atment the 


raised question ot 


preci imber of cases 


dystunction ts a basic factor. 


ever, continu with 
hormone 
whether the targe tissue 
fault. That is, since some patients 
did not respond even to massive doses 
s concluded that 
is not so much in thy- 


was not at 


of thyroxin, Hoski: 
the dithculty 


! 


roid gland production itself as it is in 


an apparent failure of the appropriate 


receiving or “‘target’’ tissue to utilize 
the hormone. 

Phe other glands which frequentls 
show the 
The function of the adre- 


nals is relevant to our subject, for 


anomalous reactions are 


’ 
adrenals. 


while there is no direct nervous con- 
trol of the adrenal cortex, its function 
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is intimately related to that ot the 
autonomic nervous Indi- 
rectly, the secretions of the adrenal 
cortex are influenced by the auto- 
nomic nervous through the 
hypophysis, which is controlled by 
the hypothalamus. 


system. 


svstem 


In the past few 
years there has appeared an impres- 
number of adrenal 
No doubt 
the research on adrenal activity has 
been accelerated because ACTH and 


cortisone have become available as 


sive studies on 


function in schizophrenia. 


research tools. 
Hloskins (46 that in 


schizophrenia the adrenal-autonomi 


suggests 
mechanism is somehow disturbed, 
probably ultimately expressing itself 
in altered metabolism. Work on the 
als and the emergency functions 
followed up by Hoagland 
17, 39, 40, $1, 42). These 

il of the many re- 


iry finding appears 


adre ie] 
has been 

and others 
studies are t' pr 
ported, The prin 
to be that the schizophrenic exhibits 
i physiologically subne rr al response 
to stress. Alts« hule, Pron esel, and 
Parkhurst (3) found that the reac- 
tions of schizophrenics to ACTH in- 
jections are not reduced, so they 
that the 


hy pothalamus 


propose itti ulty lies in 
the Perhaps this 
vital center fails to activate the pitui- 
tary to secrete sufficient adrenocorti- 
resulting in a re- 
the usual adrenal 
Hoagland et al. (39) 
port this point of view. 

\fter a rather thorough review of 
the literature on the bodily functions 


cotropic hormone 
duction of 
reaction. 


stress 


Ssup- 


in psvchoses, Altschule (2) concludes 
that most of the 


ology 


disordered physi- 
reported is a consequence of 
the psvchosis rather than an etiologi- 
cal factor. He makes one exception, 
namely that disturbances in adreno- 
cortical function may cause metabolic 
disturbances which with 
brain function, producing psychoses. 
He emphasizes that there has been a 


iuteriere 
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general tendency to ignore the physi- 
ology of mental illness. This tend- 
ency has been reinforced by the con- 
tinued use of the term “functional,” 
which implies, wrongly, a lack of 
physical or biological imbalance. Ac- 
tually, there is no such thing as a 
functional psychosis. Altschule states 
in regard to this point: ‘Theories 
that ignore cerebral metabolic proc- 
esses and regard psychosis as merely 
a state of mind, deliberately chosen 
by the patient because of environ- 
mental influences, have 
not even reasonableness 
them” (2, p. 212). 

Paralleling the endocrine studies is 
a series of experiments with various 
drugs which produce ‘‘model-psy- 
choses” (32). Studies by DeJong (21 
with mescaline, bulbocapnine, and 
other catalepsy-producing drugs led 
him to hypothesize a ‘faulty detoxifi- 
cation theory” of catatonia. Accord- 
ing to him, the liver is the organ at 
fault. 

Since DeJong’s studies, numerous 
other drugs have been used in an at- 


nothing— 
to support 


tempt to find a toxic agent responsi- 
ble for psychosis. Of these, one 
which has recently received consider- 
able attention is lysergic acid diethyl- 
amide (LSI)-25) (22, 32, 43, 47, 52). 
This drug seems to act on the hypo- 
thalamus and in normals, produces 
many of the psychological effects 
characteristic of the schizophrenic 


psychoses. The drug has also been 


administered to schizophrenics, and 
the reported effects vary. In general, 
however, the effect seems to be one of 
exacerbating existing symptoms. For 


normals, these effects are reported: 
visual disturbances, thought disor- 
ders, mood changes, paranoid tend- 
encies, auditory phenomena, ete. 
Physiological changes are also noted. 
These changes are principally those 
associated with autonomic activity. 
This leads to the hypothesis that it is 
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the homeostatic balance which is dis- 
burbed by the drug (67). 

On the negative side, Sloane and 
Doust (73) interpret their results as 
indicating that schizophrenics show 
a diminished or unchanged auto- 
nomic reactivity following adminis- 
tration of LSD-25. Their sample in- 
cluded 19 patients and 14 controls. 
In an attempt to clarify the specific 
action of the drug, Mayer-Gross, 
McAdam, and Walker (52) found 
that it produced changes in phos- 
phate metabolism. They propose 
that the apparent increased respira- 
tion of brain tissue may account for 
the psychological disturbances pro- 
duced by the drug. 

It is important to note that the 
last reviewed are 
methodological interest, for when a 


studies also. ot 
disease can be artifically produced 
it is an important step in the direc- 
tion of control of that disease. The 
chemist understand his com 
pound completely, not when he has 
succeeded in analyzing it but only 
after he can synthesize it in his labo- 
ratory. He is then in a position to 
control it and predict its action. It 
is evident, of course, that the reac- 
tions produced by the administration 
of a drug to normal persons cannot 
at this stage be unequivocally 
equated with schizophrenic psy- 
choses. For that matter, as is pointed 
out by Pennes (64), the reactions to 
LSD-25 cannot be entirely due to the 
specific action of the drug; otherwise, 
the wide variability in its effects 
would not have been found. He too 
found that not all of his 55 schizo- 
phrenic subjects reacted to the drug 
in the same way. Actually, in some 
cases the drug seemed to have a 
temporary normalizing effect. The 
most judicious conclusion is probably 
that response to the drug is the result 
of interaction between psychological 
and physiological factors. 


can 
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In spite of the large amount of 
work done with various drugs, the 
mechanism of their action is still not 
well understood. Nevertheless, more 
and more of them are being dis- 
covered and tried out. Some, as re- 
ported by the press, have assumed 
the status of wonder drugs. Reser- 
pine (54) and chlorpromazine are two 
such. Again, the [ 
these drugs seems primarily ascribed 
to their the 
nervous system (36). 


effectiveness of 


action on autonomic 


PSYCHOLOGICAL STUDIES 


lt is rather disconcerting to find 
that schizophrenic patients often re- 
spond to psychological tests in ways 
that we have learned to consider 
characteristic of patients with known 
organic involvement. This presents 
a dithcult problem to the clinician 
who is asked to make a differential 
diagnosis. In the past, test construc- 
tors have been busy trying to show 
how well their tests differentiate the 
current groups. Little 
attention has been paid to the degree 
of overlap, even though the similari- 
ties between two groups on test per- 
actually more im- 
pressive than the differences. One 
exception is the report of Hanfmann 
ind Kasanin (37) in the discussion of 
their widely known test of conceptual 
thinking. They 


significant mean 


nos le gical 


formance may be 


found a small but 
difference between 
groups of known organics and schizo- 
phrenics. Regarding the 
similarity of schizophrenics to or- 


response 


ganics, they state, ‘Cases of this type 
often bear a resemblance to 
irreversible brain 
except for a greater variability and a 
certain Imaginative quality of their 
productions which are lacking in or- 
ganic cases’ (37, p. 75). It might be 
added that this imaginative quality 
is often very difficult to detect. 
Altrocchi and Rosenberg (1) also 


close 


cases of disease, 
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found overlap of schizophrenics and 
organics in their work with a new 
block test of conceptual thinking. 
Some of the so-called functional psy- 
chotics could not be differentiated 
from known organic cases, on the 
basis of the test, although others 
could. The hypothesis is immediately 
suggested that among the heterogene- 
ous group of mental ills called ‘‘schiz- 
ophrenia,” there is a 
percentage 
damage. 
In an attempt to test this hypothe- 
sis, Brackbill and Fine (14) studied 
three groups of patients: a group of 
cases of known organic damage; a 


considerable 


suffering chronic. brain 


group of cases of classical or process 
schizophrenia, selected on the basis 
suggested by Kantor, Wallner, and 
Winder (49); and a group of reactive 
or acute Piotrowski's 
65) Rorschach signs of organic in- 
volvement, they found that the or- 
ganic and process groups could not 
be distinguished from each other, 
while the reactive group showed sig- 
nificantly 


Cases. Using 


fewer organic signs than 


did either of the other two groups of 


patients. 
tional 


This study provides addi- 
support for the that 
among schizophrenics there is an ap- 
preciable number who suffer central 
nervous system pathology. 

Another approach illustrative of 
the type of contribution which could 
he made by psychologists is that of 
Meadow and Funkenstein (53). Thev 
classified a group of 58 schizophrenic 
patients into three subgroups on the 
basis of autonomic reactivity. Group 
A had an autonomic pattern indica- 
tive of some kind of ‘‘release”’ 
cortical inhibition, 
breakdown in thalamo-cortical or 
cortico-thalamic interaction. On 
tests of abstract thinking these pa- 
tients showed the severe impairment 
usually associated with brain dam- 
age. The general behavior of these 


thesis 


from 
suggesting a 
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patients was of the nature which 
typically indicates poor prognosis. 
On the other extreme were the symp- 
tomatically schizophrenic patients of 
group B, who showed no disturbance 
in autonomic reactivity and no dis- 
turbance in abstract thinking, little 
anxiety, and no organized delusions 
Prognosis was guarded, but was bet- 
ter than for group A. The C group 
showed severe anxiety, a different 
pattern of autonomic balance, and 
It could be hy- 
pothesized that group A represented 
the classical or typical schizophrenic 
with organic while the 
other two groups represented reac- 
tive or situational disturbances. 


the best progne sis. 


symptoms, 


HIYPOTHESES AND SUGGESTIONS 


The various investigators in this 
area have number of 
ideas and concepts in regard to the 
meaningfulness of these findings. 
Eickhotf (26) and Anderson (4) pro- 
pose an organic factor in childhood 
schizophrenia. The former 
marizes her observations as follows: 


F 
Suggested a 


sum- 


I am therefore, postulating that schizo- 
phrenia in childhood is an arrest in the devel- 
’ . rh 

lought 


maturity ; n infant or 


ind emotional 
toddler level; that 
' 


. " 
isically upon a de- 


opme it 
this arres ! 
tect 


ind this is d 


in the cquisition of general sensati 
e either to a defect in the 
logical systems « 


rwcerned with pati 


tempera vibration ser 

to fault, nulation f u oO 
both; and s defect lead a delay in the 
formation i 


(26, p 234 


other im- 
ages” 

Much the same statement is made 
by Anderson as the result of her ob- 
servation of schizophrenic children: 

This outstanding modification in mv con- 
cept ial thi king cor sch rophre ia 
would be in see ing the developme nt of these 
be ing cle tern 


us rejection behavi 


cerning 


ed not by a rather 
ron the part of 
the mother, or any other significant perso: 
but as a failure in 
brought it primarily by a verv spec 
type of organic brain deficit in the child. 


reactions as 
ambigu 


interpersonal relations 


The 
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exact nature of the physiologic pathology 
would probably be in the associational path- 
ways of the most superficial layers of the 
cortex. The deficit might be of 
from minimal to extensive (4, p. 37 


any de gree 


Lauretta Bender's (10, 11) long 
years of work with schizophrenic 
children has led her to 
conception of 


a biological 
schizophrenia. At 
times she speaks of it as an organic 
brain disease, proposing that there 
is an inherited weakness in the home- 
ostatic mechanism. This weakness in 
the 
usual crises of physiological develop- 
ment may be sufficient to precipitate 
the Follow-up studies of 
schizophrenic children showed that 
the symptoms closest to the biol ywical 
best predic tors ot 
schizophrenia. 
the 
phrenia, which starts early 
nant, and is nonremitting 

All three of these investig itors see 
evidence of brain i 
these early c 
phrenia. It is 
defect is sus] ected to be 


autonomic function is such that 


illness. 


were the adult 


This again bolsters 


concept of a process schizo- 


is malig- 


damage even in 
schizo- 
¢ that the 


somewhere 


most ises of 


interesti! 


on the input side of the nervous svs- 
tem, for Bateman and Papez (7) 
also Suggest a distortion on the input 
side, resulting from cell disease in the 
thalamus. Also, according to Nielsen 
56), there is a kind of schizophrenia 
that is based on a dien | 


‘t 
cept 


ialic lesion 
He believes that 
the disease may eventually reach the 
cortex late in the process, and pro- 


of a specific nature. 


poses that the detect is probably in- 
born, producing a disore ini 
neuronal patterns. Tle 
call this an hotic 
phrenia.”” Patients of 


this group are 
called 
phrenics.”’ 


zation of 
proposes to 
““apsve schizo- 
ilso schizo- 


make 


adjustment, 


“ambulatory 
They are adic, 
a borderline schizoid 

vl ¢ > . . , : 
and their contacts with hospital and 
clinic are unproductive of any change 
in the pattern of adjustment. 


non 
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Bychowski (15, 16) speaks of 
schizophrenic thinking as similar to 
the kind of reactions de- 
scribed by Goldstein. After pointing 
out the concrete character and the 
similarity of thought processes for 
the groups, he arrives at the 
hypothesis that there is a dynamic 


organic 


two 


deficiency of cortical and subcortical 
organization in schizophrenia. He 
that a certain 


proposes group ot 


schizophrenics suffer an agnosia and 
apraxia of thought 
deficit. 

Bellak (8, 9), after reviewing some 
3,500 papers on schizophrenia, has 


based on cerebral 


formulated what he calls a “‘multiple 
7 th 


| t theory. 


iif Tr 
the ill: 
! 
i 


Hle maintains that 
1ess can be the result of a multi- 
itv of etiological factors. In one 
of liver 
in another, organic brain 

The sy mpton atology of 


case it might be the result 
pathology; 
cise we: tc. 
schizophrenia may be a_ sort of 
“final common pathway” for the ex- 
pression of produced by 
many different causes. This formula- 


tion is apparently an attempt to en- 


illnesses 


compass the divergent—and at times 
i ings in schizophrenia. 
a system, one could think of 
the behavioral s\ mptomatology as 


} 
contusing nnd 


In such 


the phenotype which is the expres- 
number of different 
typical factors. 
It is the reviewer's impression that 
indings reported by these in- 


sion of a 


geno- 


gators mv rit more consideration 
psychologists than they have 

so far received. It is probably true 
that the resolution of the’ question 
of an organic basis for schizophrenia 
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rests with neuropathologists. There 
is, however, room for psychologists 
to make a contribution. It is widely 
accepted that the group now labeled 
“schizophrenic” is a heterogeneous 
one, often with little similarity among 
its members. One of the reasons for 
such conflicting research findings 
could well be this heterogeneity. 
That is, it might be hypothesized 
that some of the variability noted 
results from using samples of pa- 
tients actually representing different 
populations. Therefore, it appears 
that one of the first steps in clarifying 
the problems of research in schizo- 
phrenia is to work out a more effec- 
tive scheme than the 
four subgroup system now used, It 
is the reviewer's impression that the 
evidence is sufficient to postulate the 
existence of organic brain disease in 
some, but not all, schizophrenic pa- 
tients. Those who show the classical 
formal thinking disorder, 
concreteness, disconnected 
etc., may have some 
kind of structural brain damage. On 
the other hand, those who show delu- 
hallucinations, etc., but no 
thought disorder, may not have cen- 
Ver- 
haps it is too early to make a definite 
statement regarding the exact nature 
and mechanisms of brain dysfunc- 
tion, either etiologically or dynami- 
cally, but certainly there are a num- 
ber of available theories of brain func- 
tion which could be applied to test 
current hypotheses and to suggest 
further ones. 


classification 


signs of 
such as 
associations, 


sions, 


tral nervous system pathology. 
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EXPERIMENTAL EVALUATIONS OF ROLE PLAYING! 
JOHN H. MANN 


Columbia University 


In recent years a number of articles 
have been written on the use of role 
playing in education, personality as- 
sessment, role training, and psycho- 
therapy. Yet, in spite of the wide- 
spread popularity of role plaving, 
comparatively few studies have been 
reported in which role playing was 
evaluated experimentally. 

The present article reviews such 
experimental studies in an attempt 
to establish the degree to which pres- 
ent practice is justified by experi- 
mental evidence, and to indicate the 
problems toward which future ex- 
perimentation might most profitably 
be directed. 

A role-playing situation is here de- 
fined as a situation in which an indi- 
vidual is explicitly asked to take a 


role not normally his own, or if his 
own in 


a setting not normal for the 
enactment of the role. 

Examples of taking a role not one’s 
own are typically found in assessment 
procedures. For example: 
boy without 
All vear long vou have 
been looking forward to having your first ex- 


twelve-vear-old 


sisters 


You are a 
brothers and 
perience in boy scout camp which runs for two 
months this summer. Camp begins next week. 
You are starting to pack as your father enters 
the room (7, p. 374 

Taking one’s own role in an un- 
usual setting typically occurs in the 
context ot a procedure designed to 
produce personality change. In such 
situations the individual is asked to 
portray scenes from his own life be- 
fore a group of other people. The in- 


11 am indebted to Dr. Edgar Borgatta and 
Dr. Leonard Cottrell of the Russell Sage 
Foundatiou for reading this manuscript and 
providing many helpful suggestions. 


dividual is acting his own role but the 
setting is unusual. For example: 

We would like to see how you normally 
handle a customer when he comes in your 
store. Suppose Mr. X plays one of your 
customers. He has just asked you the price 
of a certain article. What do you do? 


Studies of role playing can con- 
veniently be grouped in accordance 
with the two definitions considered 
above. That is, they can be consid- 
ered to treat role playing either as a 
method for assessing personality or 
as a method for producing personality 
and behavioral changes. This classi- 
fication will be adopted in the review 
which follows.? 


ROLE PLAYING AS AN 
ASSESSMENT PROCEDURE 

Early work in this area was strictly 
qualitative. In 1945, for example, 
Moreno and Moreno (17) used role 
playing to study the social stereo- 
tvpes of children. Other examples, 
summarized by Eaton (9), occurred 
during World War II. At this time 
use was made of role playing in the 
assessment and selection of per- 
sonnel. The studies of the OSS (18) 
and the proposed standardized as- 
sessment procedures of Bronfenbren- 
ner and Newcomb (7) are outstand- 
ing examples of the qualitative work 
done during this period. Interest in 
quantitative measurement did not 


? There have been a number of studies in 
which the experimenters have consciously 
played a role as part of the experimental 
procedure. These studies are not included in 
this review for two reasons. First, the experi- 
mental subjects themselves do not role-play. 
Second, role playing is used strictly as an 
experimental technique for fabricating reality, 
and is not itself the object of study. 
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occur in this area until experimenters 
became concerned with the reliability 
and validity of their measuring in- 
struments. 


Reliability 


Probably the first to test the relia- 
bility of ratings of role-playing per- 
formance were Rotter and Wickens 
(20). They asked observers to rate 
subjects placed in two standard role- 
playing situations on the single char- 
acteristic of i ggressiveness.”’ 
During one situation the subject was 
observed by a panel of four observers. 
During the second situation he was 
observed by a different panel of ob- 
servers. There were also three ob- 
servers who watched his behavior in 
both situations. Under these condi- 
tions a mean interrater reliability 
coefficient of .71 was obtained for a 
panel observing a single situation 
(NV of 48). The correlation between 
the mean ratings of the members of 
the two panels on the behavior of the 
48 subjects over two situations was 
.55, while the mean ratings of the 
three observers who watched the sub- 
jects in both situations had a reli- 
abilitv coefficient of .77. The .55 
correlation may be interpreted not 
only as rater reliability but as subject 
consistency from situation to situa- 
tion. The larger .77 correlation may 
result in part from a halo effect. 

Stanton and Litwak (22), in a 
study devoted to evolving a test of 
autonomy, asked five judges to re- 
cord the number of times they heard 
examples of “nonideal” behavior on 
11 tape recordings of role-plaving 
situations.’ In this wav they 
tained a mean reliability coefficient 
of .90 (rho). 

Moldawsk\ 


“social 


ob- 


(16), also using tape 


* Stanton and Litwak equate nonideal be- 
havior with lack of autonomy. This may be 
a dubious equation, but it does not affect the 
present discussion. 
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recordings of role-playing situations, 
was able to obtain an interrater re- 
liability coefficient of .89 for ratings 
of “rigidity” of behavior. 

Kelly and Fiske (13), interested in 
predicting clinical 
chology, asked judges to rate sub- 
jects in role-playing situations on 
characteristics which were unrelated 
to the immediate role-playing enact 
ment. Among these characteristics 
were research ability, therapeutic 
ability, diagnostic abilitv, and over- 
all suitability for the profession of 
clinical psychology. Interjudge_re- 
lability for these characteristics was 
found to be .45, .44, .40, and .51 re- 


spectively. In this study, however, 


success in psy- 


not only was the reliability of the 
observer's judgment being measured 
but also the degree of similarity be- 
tween his conception of the abilities 
required for successful performance 


in research, therapy, and diagnosis 
and that of the other judges. In the 
single case in which the individual 
judges’ ratings were averaged and 
corrected by the Spearman-Brown 
formula, the reliability for the given 
variable rose from .51 to .76. 

Borgatta (2) has studied the relia- 
bility of “real” situations and 
playing situations. In his study the 
observation svstem (Bales Intera: 
tion Analysis) was held constant and 
the reliability of the categories in the 
system was within the 
tvpes of situations. It was found that 
the kinds of behavior which an indi- 
vidual initiates and receives are 
as consistent in role plaving as in 
“actual” behavior. 

Almost any system for observing 
human interaction which 
devised can be used for observing 
role playing. The reliability asso- 
ciated with such systems is therefore 
related to the present discussion, but 
here we report only the findings de- 
rived from interaction situations 


role 


measured 


about 


has been 
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specitically labeled role playing. In 
summary, it can be said that the 
above studies tend to confirm the 
possibility ot establishing reliable 
systems of observing role-plaving be- 


havior 


Studies of validitv can be ordered 
in relation to the nearness of the test- 
lig situation to the validating situa- 
Won. Phe followi 


ordered 


discussion 1s 
this principle, 
which the 
nd extending to 
the testing and 
tenu- 


iccording to 
with 
adjacent 


which 


starting studies in 
two are 
studies in 
validating situations are only 
ously related 


Borgatta 2.3, 4,5,6 


reports has described the results of a 


ina series of 


which subjects alternatels 
pl ived and “actually” 


i an experimental setting. The sub- 


study in 
rol behaved 
ects of this experiment were assigned 
and rotated 
isure that each 
subject would have equal experience 


to 166 three-man grouns 


such a wav as to et 


with an array of persons. 

Perhaps the most striking finding 
of this study in relation to the present 
context Was that, though the interac- 
greatly between 
the role-plaving and actual situation, 
the performance of group members 
remained remarkably 
lation to each other. 


tion profiles varied 


constant in re- 
Specifically the 
correlation between t« tal rate of in- 
teraction initiated and received and 
rate of positive emotional responses 
initiated and received over the two 
76, .66, .58, and .58 

\ correlation of .17 
was significant at the .05 level.) In 
id close parallelism 


ituations was 

respec tively. 
ition, a Was 
found between role plaving and ac- 
tual behavior over all the Bales Inter- 
action Categories except that of ‘‘ask- 
ing for suggestions’ which was a null 
variable for the actual behavior. 


Certain differences between role 
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playing and actual behavior were 
noted however. Role playing was 
characterized by lack of tension and 
inhibition, and by higher rates of ask- 
ing for and giving opinion in compari- 
son to the actual behavior in which 
more emphasis was placed on neu- 
trality of feeling and task orienta- 
tion. It is also interesting to note that 
leadership self-ratings made by sub- 
jects, and leadership ratings made 
by their associates appeared to cor 
relate more highly with their 
formances in role playing than with 
their performances in the actual be- 
havior situation. 

In a further analysis of these data 
Borgatta (6) compared role plaving 
and actual behavior with behavior on 
a paper-and-pencil projective test 
(Conversation Study The test con- 


per- 


sisted of a leaflet in which ten plates 
with various three-man situations 
were drawn. The subject was asked 
to write down what each of the men 
This imagined conversa- 
tion was then analyzed by the Bales 
Interaction Categories in the same 
manner as the role plaving and the 
actual behavior. From these data a 
correlation of .3 was obtained be- 


Was saving. 


tween the total rate of response on 
the projective test and that of either 
the role playing or the actual behav- 


ior. Role plaving and actual be- 
havior, as previously mentioned, were 
highly correlated. 

his finding was similar to one ob- 
tained in an earlier experiment by 
Borgatta (1 In that pilot experi- 
ment the comparison was made be 
tween a response to the 
Rosenzweig P-F Study when given 
normally (in written form), when 
used as the basis for a series of role- 
playing incidents, and as the basis 
for an analogous series of apparently 
unplanned real-life incidents. In this 
earlier experiment, in which 78 sub- 
jects were utilized, the findings were 


subject's 
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not clear because of the low reliability 
associated with the scoring system 
which distributed 24 scores over 11 
categories. 

To clarify further the relationship 
among these three types of situations 
Borgatta (6) subjected his data from 
the later study to a factor analysis. 
Of five relevant factors found, two, 
task ability and emotional assertive- 
ness, showed close parallelism over 
role playing and actual behavior and 
some parallelism over the projective 
test. A third factor, military adjust- 
ment, showed some parallelism over 
role playing and actual behavior, 
with a suggestion of parallelism over 
the projective test. In a fourth factor, 
task supportiveness, parallelism was 
found over role playing and actual 
situation only. Finally in a fifth 
factor, emotional group supportiveness, 
no parallelism was found. 

In his work Borgatta measured 
actual and role-playing behavior 
which occurred in the same experi- 
mental setting. Stanton and Litwak 
(22) expanded the distance between 
these types of behaviors by studying 
the relationship between role playing 
occurring in an experimental setting 
and actual behavior occurring in a 
natural setting. Thev were interested 
in studying autonomy as an aspect 
of stress-tolerance. They used a test 
which consisted of three standard 
stress situations. Raters were asked 
to note the number of nonideal be- 
haviors which occurred during the 
enactment. The validity of the test 
was determined by correlating the 
measure of autonomy so obtained us- 
ing a group of foster parents as sub- 
jects with ratings of autonomy made 
by social caseworkers who had con- 
siderable contact with the foster par- 
ents. <A correlation of .82 was ob- 
tained. A correlation of only .48 was 
found between the caseworker’s rat- 
ing and the rating of the homefinder, 
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who interviewed the foster parents 
in their home. In a small class of 
eight students, a correlation of .79 
was found between ratings of auton- 
omy determined from role playing 
and ratings made by close friends. 

It is also interesting to note that 
while raters who saw only role-play- 
ing scenes referring to autonomy were 
able to make ratings that correlated 
.82 with caseworkers’ ratings, raters 
who saw other role-playing scenes 
in addition, were able to make rat- 
ings which correled only .40 with 
caseworkers’ ratings. 
that the additional information which 
raters received contaminated, rather 
than clarified, the rater’s perception 
of the role-playing situation. 

Kelly and Fiske (13) attempted to 
use role-playing behavior as a basis 
for making predictions of future suc- 
cess in clinical psychology. In their 
large-scale assessment program they 
utilized a battery consisting of stand- 
ardized paper-and-pencil tests, pro- 
jective tests, interviews, written back- 
ground material, and _ situational 
(role-playing) tests. Two kinds of 
validation are provided by these 
data. First, one can consider the 
correlation between the situational 
test rating and the over-all rating ob- 
tained by the use of the whole test 
battery which, in this case, is the 
validating criterion. It found 
that observers seeing only the situa- 
tional tests could make ratings as 
good as those made by other observ- 
who read the subject’s back- 
ground credentials and conducted a 
preliminary interview with him, or 
who had his background cre- 
dentials, his objective and projective 
test results, and had read his auto- 
biography. Second, one can consider 
the correlation between the situa- 
tional test rating and the criteria 
utilized to establish later successful 
performance in clinical psychology. 


This suggests 


Was 


ers 


seen 
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The correlation between the over-all 
score based on the test battery and 
the validation criterion was .34. The 
correlation between the situational 
tests and the validating criterion was 
.20. This is not impressive since an 
equivalent correlation was obtained 
by using either the credential file or 
a standard intelligence test score. 
This low correlation may reflect the 
fact that the role-playing situations 
used were unrelated to the validating 
criterion in content. 

In summary, it can be said that 
there which indi- 
cates that valid predictions of inter- 
personal behavior can be made from 


is some evidence 


role-playing assessment procedures. 
The actual development of such pro- 
cedures has, to date, been very slow 
the problems in- 
volved have hardly been explored. 


and consequently 


ROLE PLAYING AS A METHOD FOR 
PRODUCING PERSONALITY 
CHANGE 


Role playing has been used to pro- 


duce personality and behavioral 
change in a wide variety of settings 
varying from leadership and teacher 
training to psychotherapy with neu- 
rotics and psychotics. Comparatively 
little experimental work has been re- 
ported, though application has been 
extensive. 

Harrow (19) studied the effect of 
role playing on 20 schizophrenics or- 
ganized into two groups that met 25 
times over a two-month period. <A 
third group of schizophrenics was 
used as a control. Each individual 
was given the Rorschach, the MAPS, 
and a special role test before and after 
the role-playing Although 
statistically significant results were 
not obtained, a group of judges look- 
ing at the before-after data found evi- 
dence of the development of more 
realistic perception of and interest 
in the outside world as well as an in- 


series. 
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creased ability to handle personal 
problems. 

Jones and Peters (12) in a similar 
experiment studied the effect of role 
playing on a group of twelve schizo- 
phrenics and a group of eight lobot- 
omy patients. Another group of 
schizophrenics was used as a control. 
The role-playing sessions were held 
once a week for four months. Before- 
after evaluation was made using the 
Porteus Maze, the Mirror-Tracing 
Test, Gardner Behavior Chart, Ror- 
schach, and the Draw-A-Man Test. 
Again, although statistically signifi- 
cant results were not obtained, it was 
found that all tests except the Draw- 
A-Man Test indicated improvement. 

Sause (21) has investigated the ef- 
fect of role playing on a group of 15 
normal student teachers. At the be- 
ginning of each role-playing session 
the group members selected a prob- 
lem related to student-teacher inter- 
action as the focus of the session. 
They then each wrote down their 
solution to the problem. Following 
this they role-plaved the problem 
and again wrote down their solution. 
Finally they rated their satisfaction 
with the meeting. Four judges rating 
the written solutions before and after 
a given role-playing were 
practically unanimous in finding an 
improvement in the solutions after 
role play ing. The judges agreed with 
each other 80 per cent of the time. It 
was also found that the quality of the 
solutions reached before role plaving 


session 


improved as the sessions progressed. 
Eighty-seven per cent of the student 
teachers were rated as having better 
solutions before role-playing at the 
twelfth meeting than at the third, 
indication that a transfer of training 
mav have occurred. 

Maier (15), also working with nor- 
mal groups, has tested the effect of a 
specialized form of role playing (Mul- 
tiple Role Playing) in helping groups 
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reach satisfactory solutions to com- 
mon problems. Using this technique 
he found that 42 out of 44 groups 
reached a solution, and that only 
four per cent of the group members 
were dissatistied with the solution 
reached. 

Because of either small sample size, 
design, or lack of underlying relation- 
ship, none of the four preceding stud- 
ies provide statistically significant 
results.: In addition, two of them fail 
to utilize a control group. It is best, 
therelore, to consider them as ex- 
ploratory in nature. 

\ more satisfactory study trom the 
methodological viewpoint was that 
of Janis and King (11). In their de- 
sign they utilized 90 college students 
divided into groups of three. Each 
student short talk to the 
other two on an assigned topic. Each 
student, therefore, spoke once and 
listened twice. The wer 
assigned in such a way that the stu- 
dent speaking presented a point of 
view contrary to his own as deter- 
mined by a previous opinion ques- 
tionnaire. At the end of each speak- 
role-playing session 
the opinions of the listeners and the 
speaker were reassessed. 

The results indicated that, for two 
of the speaking topics used, there was 
significantly more change of attitude 
in the speaker than in the listener. 
For the third topic the change Was 
not significant, but the speakers ex- 
greater the 

their opinions than the 
It was also found that in- 
dividuals who improvised most and 
whio with their per- 
formance as speakers underwent the 
greatest amount of opinion change. 
There are two difficulties in generaliz- 
ing the results of this study. First, 
speaking on an assigned topic before 
two other people is a very limited 
form of role playing. It involved only 


gave a 


topics 


ing session (i.e., 


pressed confidence in 
sureness ol 
listeners. 


were satisfied 
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a one-way interaction between the 
active speaker and the passive audi- 
ence. The typical role-plaving situa- 
tion consists of at least two-way ac- 
tive interaction. Second, the opin- 
ions changed were unimportant. 
They dealt with the number of movie 
theaters that would exist in three 
vears, the size of the probable meat 
supply in the following vear, and the 
length of time that would be required 
to find a cure for the common cold. 
It is probable that 
topics was of central importance to 
the students tested. If this had not 
been the case the students might have 
shown greater resistance to change 

The results obtained by Janis and 
King were replicated by 
(19) 


Using 


none ot these 


Rosenberg 
under less artificial conditions 
three 
with 


standard 
thre 2 


role-plaving 
situations groups of 15 
persons each, group members were 
rotated 
positions, role player 
structed to identify 
plavers), and 

structed to watch the action as ob- 
jectively as possible A 
group of control 
signed simply to watch the 
The role-playil £ situation involved 
such topics as labor-management re- 
lations, guidance problems, ete. After 


among three experimental 
identifiers (in 
role 


oh ervers (in 


with the 


active 


Spc ial 
observers Was das 


action 


each role-playing session a question- 
naire Was given to all group members. 
This was followed by a carefully ob- 
served nondirective group discussion 
about the role plaving. Usi: 
gathered it was found that the role 
players were most 
change. Thev were also found to be 
the most productive in the discussion 
after the role plaving, to have cer- 
tain emotional judgment 
and perception, and a tendency to 
support their own actions in the role 
plaving. The identifiers were found 
to be most critical, almost as produc- 
tive as the role plavers during the dis- 


gv data so 


susceptible to 


biases of 
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cussion, and to have an awareness of 
the dynamics of the interaction proc- 
ess second only to the role player. 
The active observer was found to be 
uninvolved and therefore unemotion- 
al with reference to the enactment. 
lle was as critical as the identifier but 
relatively unaware of the dynamics 
Finally, 
the control observer was found to be 
indifferent, unproductive during the 
discussion, and relatively unaware of 
An inter- 
this study is 
that understanding of the dynamics 


of the interaction process. 


the interaction dynamics. 
esting implication of 


of interaction processes is associated 
with involvement in those processes. 
It is also intere sting to note that in- 
dividual involvement was found to be 
a function of position assignment. 
While 
with the differences induced by ex- 
perimental position within the role- 
she did not 
sider ditferences between individuals 
in the same These differ- 
ences have been studied by Luszki 
14) who compared observers who 


Rosenberg Was Cor erned 


plaving situation, con- 


posit on. 


were good at understanding the in- 
amics in a role-playing 
She 
dis- 
were 
good at others 
looked at were good 
at determining how others looked at 
them, (c) were well adjusted, (d) eval- 
uated themselves as others evaluated 
them, were consistently and 
favorably perceived by others. Fur- 
ther it was found that persons who 
were good judges of how others felt 
about them were able to identify most 
successtully with role plavers. Per- 
sons who were good judges of how 
others felt about themselves were 
found to be good at observing the 
actions of role plavers critically. 
Brown (8) has explored the effect 
of the role-playing situation on the 


teractiol! dvi 
with those who were not. 
that 


tinguished trom poor ones, (a 


scene 


good observers, as 


found 


determining how 


themselves, (6 


and e 
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success of the role player's perform- 
ance. He did this by asking his sub- 
jects to reverse roles with each other 
and having them evaluate how well 
the other person succeeded in por- 
trayving them. It was found that suc- 
cess Was inversely related to the real- 
ism of the role-playing situation. The 
more unusual the situation in which 
the role was portrayed, the better 
was the portrayal. 

In summary, the findings in this 
area are sket« hy and essentially sug- 
gestive in nature. The evidence indi- 
cates that role plaving may produce 
behavioral and personality change, 
but this has certainly not been estab- 
lished with any degree of confidence. 
The evidence also suggests that the 
effect of role playing on the individual 
is related to his personality and to his 
position in the role-plaving situation. 
In addition, the success of the role 
plaver may be related to the setting 
of action in the role-plaving situa- 
tion. 


SUMMARY 


Perhaps the most striking impres- 
sion to be gained from a review of the 
experimental studies of role playing 


is their scarcity. Except for the few 
studies reported here, effort among 
practitioners of role playing seems to 
have been devoted to the exploita- 
tion of its various applications. 

With reference to personality as- 
sessment, there is some sound evi- 
dence for believing that reliable and 
valid role-plaving tests can be de- 
veloped. Such tests are, however, 
still in their infancy, and it is toward 
the development of these tests that 
future effort might profitably be 
directed. 

With reference to personality 
change, it can only be said that there 
is as vet little supportive evidence. 
The studies which do exist suggest 
the possibility of such change and in- 
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reach satisfactory solutions to com- 
mon problems. Using this technique 
he found that 42 out of 44 groups 
reached a solution, and that only 
four per cent of the group members 
were dissatisfied with the solution 
reached. 

Because of either small sample size, 
design, or lack of underlving relation- 
ship, none of the four preceding stud- 
ies provide statistically significant 
results. In addition, two of them fail 
to utilize a control group. It is best, 
therelore, to consider them 
ploratory in nature. 

A more satisfactory study trom the 
methodological viewpoint was that 
of Janis and King (11). In their de- 
sign thev utilized 90 college students 
divided into groups of three. Each 
student short talk to the 
other two on an assigned topic. Each 
student, therefore, spoke once and 
listened twice. The were 
assigned in such a way that the stu- 
dent speaking presented a point of 
view contrary to his own as deter- 
mined by a previous opinion ques- 
tionnaire. At the end of each speak- 


as eCX- 


gave a 


topics 


role-playing session 
the opinions of the listeners and the 
speaker were reassessed. 

The results indicated that, for two 
of the speaking topics used, there was 
significantly more change of attitude 
in the speaker than in the listener. 
For the third topic the change was 
not significant, but the speakers ex- 
the 
sureness of their opinions than the 
It was also found that in- 
dividuals who improvised most and 
who were satisfied with their per- 
formance as speakers underwent the 
greatest amount of opinion change 
There are two difficulties in generaliz- 
ing the results of this study. First, 
speaking on an assigned topic before 
two other people is a very limited 
form of role playing. It involved only 


ing session (1.e., 


pressed greater confidence in 


listeners. 
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a one-way interaction between the 
active speaker and the passive audi- 
ence. The typical role-playing situa- 
tion consists of at least two-way ac- 
tive interaction. Second, the opin- 
ions changed unimportant. 
They dealt with the number of movie 
theaters that would exist in three 
vears, the size of the probable meat 
supply in the following vear, and the 
length of time that would be required 
to find a cure for the common cold. 
It is probable that 
topics was of central importance to 
the students tested. If this had not 
been the case the students might have 
shown greater resistance to change 

The results obtained by Janis and 
King were replicated by 
(19) under less 
Using three = standard 
situations with three 


were 
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Rosenberg 
artificial conditions 
role-plaving 
groups of 15 
persons eat h, yroup menibers were 
rotated among 
positions, role player a 
structed to identify 
players), 


three experimental 
identifiers (in 
the 


observers (in 


with role 


and = active 
structed to watch the action 
jectively as possible A 
group of control observers was as 
signed simply to watch the 
The role-playing situation involved 
such topics as labor-management re- 
lations, guidance problems, etc. After 


each role-playing session a question- 


as ob- 


Spec ial 


action 


naire was given to all group members. 
This was followed by 
served nondirective group discussion 
about the role playing. Using data so 
gathered it was found that the role 
plavers were most susceptible to 
change. Thev were also found to be 
the most productive in the discussion 
after the role plaving, to have cer- 
tain emotional biases of judgment 
and perception, and a tendency to 
support their own actions in the role 
plaving. The identifiers were found 
to be most critical, almost as produc- 
tive as the role players during the dis- 


a caretully ob- 
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cussion, and to have an awareness of 
the dynamics of the interaction proc- 
ess second only to the role player. 
The active observer was found to be 
uninvolved and therefore unemotion- 
al with reference to the enactment. 
Hle was as critical as the identifier but 
relatively unaware of the dynamics 
Finally 
the control observer was found to be 
indifferent, unproductive during the 
discussion, and relatively unaware of 
An inter- 
esting implication of this study is 
that understanding of the dynamics 


of the interaction process. 


the interaction dynamics. 


of interaction processes 1s associated 
with involvement in those processes. 
It is also interesting to note that in- 
dividual involvement was found to be 
a function of position assignment. 
While 
with the differences induced by ex- 
perimental position within the role- 
plaving situation, she did not con- 
sider differences between individuals 


Rosenberg Was Cond erned 


These differ- 
ences have been studied by Luszki 
14) who compared observers who 
were good at understanding the in- 
teraction dynamics in a role-playing 
She 


in the same position. 


scene with those who were not. 
found that 
tinguished from poor ones, (a) were 
determining how. others 
looked at themselves, (6) were good 
at determining how others looked at 
them, (c) were well adjusted, (d) eval- 
uated themselves as others evaluated 
them, and (e) were consistently and 
favorably perceived by others. Fur- 
ther it was found that persons who 
were good judges of how others felt 
about them were able to identify most 
successfully with role plavers. Per- 
sons who were good judges of how 
others felt about themselves were 
found to be good at observing the 
actions of role players critically. 
Brown (8) has explored the effect 
of the role-playing situation on the 
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success of the role plaver’s perform- 
ance. He did this by asking his sub- 
jects to reverse roles with each other 
and having them evaluate how well 
the other person succeeded in por- 
traving them. It was found that suc- 
cess Was inversely related to the real- 
ism of the role-playing situation. The 
more unusual the situation in which 
the role was portraved, the better 
was the portrayal. 

In summary, the findings in this 
area are sketchy and essentially sug- 
gestive in nature. The evidence indi- 
cates that role playing may produce 
behavioral and personality change, 
but this has certainly not been estab- 
lished with any degree of confidence. 
The evidence also suggests that the 
effect of role playing on the individual 
is related to his personality and to his 
position in the role-plaving situation. 
In addition, the success of the role 
plaver may be related to the setting 
of action in the role-playing situa 
tion. 


SUMMARY 


Perhaps the most striking impres- 
sion to be gained from a review of the 
experimental studies of role playing 


is their scarcity. Except for the few 
studies reported here, effort among 
practitioners of role playing seems to 
have been devoted to the exploita- 
tion of its various applications. 

With reference to personality as- 
sessment, there is some sound evi- 
dence for believing that reliable and 
valid role-plaving tests can be de- 
veloped. Such tests are, however, 
still in their infancy, and it is toward 
the development of these tests that 
future effort might profitably be 
directed. 

With reference to personality 
change, it can only be said that there 
is as yet little supportive evidence. 
The studies which do exist suggest 
the possibility of such change and in- 
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dicate a few of the relevant variables 
that may be involved. Much effort 
is needed in the exploration of role 
plaving as a method for producing 
change. If the possibility of such 
change were established a number of 
related topics become pertinent. For 
example: What personality charac- 
teristics are most affected bv role 
plaving? Are all role-playing tech- 
niques equally effective? Is role play- 
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ing less effective in producing change 
if it is preplanned? What is the effect 
of the audience on the role players? 
These questions, and others which 
will readily occur to the reader, are 
open for experimentation. But they 
all rest on the assumption that role 
plaving produce personality 
change, and this has not as vet been 
conclusively demonstrated. 


can 
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te the somewhat pessimisti- A review by Watson (127) ap- 


ded summary in our last’ peared in the interval since our last 
W-B? (97), the ava- review. Our attempt will, again, be 
irch studies with this to make an exhaustive and critical 


is abated verv little. examination of the st 


d 
phy that accumulated area. Many of these would prebably 
attests to that. Moreover, this fact not have been published had authors 


isa testament to the vitality of the in d editors heeded the recommenda- 


ies in this 


tions of our previous review. We have 

ken care to prepare a complete 
bibliography so that taken with the 
earlier two reviews (96, 97) the litera- 
ture should be well cover 


t 
Le 


' 
( 


The organiza 1O1 ) I present 


review resembles cl] v that of the 


previous one. irst lf deals 
with the W-B as a test of general 
intelligence, whereas the second hall 
of the article is devoted studi 
! 


assifications of dev 


with sundrvy « 


and disordered subjects. 


THE WECHSLER-BELLEVUE AS A 
TEST OF GENERAL INTELVIGENCI 
Studies of Reliability 


Recent interest in practice effects 
and the dependability of subtest 
he previously ob 

served lacunae of reliability data o1 
the W-B. Most of the currently re 
viewed studies present reliability co 
efhicients for the subtests as well 


tine Pad : for the three 1Q scales. 


scores has filled t 


\ con mn aut? ~ 
d DS correspond to the following Test-retest agreement. the two 
| | ture \rra sement, Picture Com- studies by Steisel (117. 118) invest 
Dj ae es r righ ip = i gated the eft ct of pra tix e on the 
Full Scale, Verbal 1Q, and Performance 10, W-B_ scores. Subjects were 17 
respectively , matched pairs of college girls of su- 
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perior intelligence. The first of each 
pair was retested after 14 days while 
the retest interval for the 
each pair was 77 
effects on all three scales appeared for 
both groups. Mean increase in FS 
10 was eight to nine points. The P 
IQ increases approximatels 
double those for the V IQ. Steisel 
(117) points out that “ . ina retest 
situation the examiner should place 
more trust in the verl " 
Differences in the retest scores after 
the short and long intervals revealed 
that only the score on the A subtest 
was a function of the 
ministration. The 
“The findings in general corroborate 


second of 


davs. Practice 


were 


il SOOTES « « «a 


recency of ad- 


author concludes, 


those of previous studies in indicating 
that retest 
scores are maintained up to approxi- 
mately three months... ."’ A simi- 
lar study, but with mental defectives, 
is reported by H ivs and Schneider 


the significant gains in 


(56). They employ ed different retest 


periods of two, four, six, and eight 
The results also suggest that 
the different retest intervals do not 
affect the 1O increases differentialls . 
The W-B Form I FS 10 showed an 
increase of 7.6 points due to practice 


4.4 for the 


weeks. 


as compared with only 
Form II. 

Spl t half reliab lity. 
and De Haan (129 
split-halt reliability 
psychotics than for 


When Webb 
reported higher 
coefficients for 
normals, thes 
provoked much thought. Helmick 
(57) criticizes their failure to equalize 
the range ol talent for the two groups. 
Webb (128) confidently replies that 
Helmick’s suggestion of 
the ranges of talent would create a 
hybrid sample which was unreal and 
could never be representative of an 
actual population of people. 

Perhaps less interesting but more 
important than these asides are the 
main features of the study. Webb 
and De Haan (129) employed 100 


equaliz ing 
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subjects to provide reliability coefti- 
cients for the subtests. All the sub- 
test statisticalls 
significantly different from zero for 
the paranoid subjects, while only PA 
failed to show significance with the 
normal half of the sample. The 1 
subtest was the most reliable for both 
samples. Only two subtests showed 
much difference in reliability for the 
different sampling. These 
two subtests, PA and PC, were more 
reliable for the psychotics. Botwinick 
14) utilized Webb and De Haan’s 
normal group, a group of 50 older nor- 
mals, and a group of 31 patients with 
mental disorders of the senium. Un- 
fortunately sex could not be held con- 
stant for all three groups. Corrected 
split-halt coefficients were calc ulated, 
and only C failed to differ signifi- 
cantly from zero. The author's find- 
ings were characteristic for this type 


coefficients were 


subject 


of study. The coefficients were large 
for the older subjects and were still 
greater for the disordered 
group. As the subtest variability in- 
due to aging or mental 
changes associated with illness, the 
reliability seem to in- 
crease in magnitude. The reliability 
of some subtests is not as high as 
might be desired. Thus, it is not sur- 
prising that the authors (14, 129) are 
skeptical of diagnostic 
pattern analyses that are based upon 
the more unreliable of the subtests. 

Summary. The additional research 
the current 
vides sufficient information to guide 
the clinician. The studies have been 
well designed and are fairly conclu- 
sive. The 10 scales seem reliable, al- 
though subject to practice effects. 
The practice-effect increment does 
not depend greatly upon the size 
of the test-retest interval provided 
the retesting occurs within three 
months. Practice effects are about 
twice as great on the performance 


+ 
senie 
creases 


coefficients 


emploving 


covered in section pro- 
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section. Split-half reliability coefh- 
cients for the various subtests show 
quite a range of values. Some of the 
smaller coefficients cast doubt upon 
the usefulness of diagnostic pattern 
analysis which is based upon the more 
unreliable subtests. Additional work 
in the area of reliability should be 
well designed and comprehensive. 
Correlation With Other Tests 

The W-B is 
compared with the Form II in a com- 
prehensive study by Gerboth (45). 
Retest disclosed that when 
form II was administered first there 
it increase in the subse- 


: —— , 
Other Wechsler scales. 


results 


Wasa significa 
T he scores 
higher by about five 
points for all scales. Surprisingly, 
Havs and Schneider (56) found this 
sume elevation of W-B scores when 
they preadministered the Form II. 
This is surprising, because instead of 
normal superiors they employed men- 
tul defectives. For some reason pre- 
administering the Form | not 
the subsequently obtained 
as much. A 


quent \W-B seales scores. 


were uniformly 


does 
elevate 


Form II 


scores nearly 


satisfactory explanation of this spe- 
cial effect has not been forthcoming. 
to Gerboth’'s 


intercorrelations 


study 
45), the between 
the subtests sucvest that the J, C, A, 
and PA subtests were least compa- 
rable between the W-B and the Form 
The correlation between FS IQ's 
is about .75. This is fairly high 
the range of talent was only halt 
that for an unrestricted sample. 
Another Wechsler scale, The Wechs- 
ler Intelligence Scale for Children 
(WISC), was compared with the W-B 
bv Delattre and Cole (31). Witha 
good range of intelligence they found 
a correlation of .87 between the FS 
10's.’ The mean FS 1Q was six points 
higher for the WISC.  Intercorrela- 
tion of corresponding subtests of the 
two tests provides indices of compa- 
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rability. PA was particularly low with 
an r of only .19 between the two tests. 
The rather moderate correlations pro- 
voke the authors to wisely invoke 
caution in applving W-B patterns 
and signs to WISC data. An almost 
identical with a narrower 
range of intelligence is found in a 
more recent study by Knopf, Mur- 
fett, and Milstein (68). Their results 
provide good confirmation of the 
earlier findings of Delattre and Cole 
(31). They also emphasize the lack 
of comparability between corre- 
sponding subtests of the two tests. 
Vanderhost, Sloan, and Bensberg 
(124) report on the gross compara- 
bility of the W-B and the WISC for 
a mental defective group. Their ex- 
perimental neatly counter- 
balances extraneous factors. With a 
restricted range of IQ they report a 
correlation of .72 between FS 1Q's. 
They found the mean WISC to be 
1.6 IQ points higher than the W-B, 
a difference not statistically signifi- 
cant. 

Other 


design 


design 


intelligence tests. Alderdice 
and Butler (3) found a correlation of 
.69 between the W-B FS and the 
Revised Stanford Binet (S-B), Form 
M. However, the correlation is some- 
what attenuated by the narrow range 
of 10's in the sample. Correlations 
the W-B and the S-B are 
considerably higher when an appro- 
priate range of talent is emploved 
(96, 97). As encountered in the past, 
the W-B 10 was found to be some- 
what higher than that for the S-B (3). 
Duncan (37) presents a table for con- 
verting W-B scores to 
S-B vocabulary scores, therebv elimi- 


between 


vocabulary 


nating the necessity of administering 
both vocabularies, when both tests 
are included in a batterv. 

The Army General Classification 
Test (AGCT) was compared with the 
W-B by Tamminen (121). A correla- 
tion of .91 was obtained between the 
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two tests when the sample was cor- 
rected for range of talent. The AGCT 
appears to be more closely related to 
the V 10 than to the P 10 of the W- 
B. The American Council on Educa- 
tion Test (ACE) correlated .61 with 
the W-B FS with 
of 10's in the sampling by Merrill and 
Heathers (86). Gerboth (45) reports 
a comparable correlation between 
the ACE and the W-B but also em- 
ployed a narrow range of talent. A 
thesis by Smith (109) reports the 
relationship between the SRA Pri 
mary Mental Abilities Test but was 
not available for review. 

Knott ef al. (69) compared the W-B 
with the Kent Battery. 
The correlation with the FS 1Q was 
.87, but this is difficult to in 
since the range of talent of the sample 
was Delp (32) com- 
pared only the EGY of the Kent Bat 
tery with the W-B. Lhe correlatio: 
and the of talent was 
slighth reduc ( d. Raven's Pro- 

j 


a restricted range 


seven-test 
terpret 


not report ted. 
1 


was .65, range 


’ 
Ooniy 


gressive Matrices correlated .55 with 
a short form of the W-B according to 


Levine and Iscoe (71). They 


“The Progressive Matrices appeared 


State, 


to tap areas ol intelligence most 
related to BD and not signifti- 
cantly related to C.” In 
study Levine and Iscoe (72) found 
the correlation between the W-B and 
the Matrices ‘ ough 
for individu but 
with a deaf sample. Desai 
comp the Matrices against 
’ IQ of the W-B. He ob- 
n rot .65 which was corrected 
Allen 


foul 


closels 


al other 


. hot high el 
| prediction...” 
this was 


(33 
just the 


red 


tained ; 
for attenuation. Thornton, 
and Stenver 5 
ingly high correlation of .86 between 
the Ammons Picture Vocabulary and 
the W-B. Rubin (101) presents the 
results of his comparison of the W-B 
and quantitative the 
H-T-P. With an appropriate range 


id the surpris- 


features of 
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of talent he obtained a correlation oi 
.67. The mean IQ's were rather com- 
parable. 

Other Storrs (119) re- 
ports interesting relationships 
tween the W-B scales and the Gen 
eral Aptitude Test Battery (GATB). 
The Battery interest, 
verbal, and numerical { 
related to the V IQ, The test-factor 
related to P IO were spatial, 


measures. 


showed the 


ctors to be 


form 
perception, clinical 
ing, and motor speed. A correlation 
of .58 was found for the rel. 
between the total score for the Bat 
terv and the W-B FS. Phe correla 
tion with the P section w 
higher than with the FS. 

interest in the correspondence ‘be 
tween W-B and intellectual 


of Rorschach 


perception, aim 


LIONS 


ehtly 


is shi 


aspects 
performance is. en- 
Abrams (1 


countered in the work ot 


A multiple-regression equation is pre 
Lhe 


volved are 
Fhe multiple 


sented which predicts 1Q score. 
Rorschach elements 1 
F+%, M, W, and R. 
R is .53. 

Only 
scores with criteria other than 
Gerboth (45) found that both forms 
of the W-B were significantly related 
to grade-point average in The 
correlation for Form | was .29, while 
that for Form II was .26 
based upon a narrow range ol 
talent. Although Merrill and Heath 
ers (86) also sampled intelligence nar 
rowly thes 
58 between W-B TV 1Q and grade-point 
average. 


W-B 


tests. 


two studies comp ired 


school. 


] he se r's 


are 


obtained a correlation of 


As time passes the W-B 
and its later forms seem to attain an 
venerable position. It 
' standard ot 


Summary. 


1ncre isingly 
is now the well-accepte 
intelligence evaluation tor adults. As 
many tests of varied 
correlated with the W-B it becomes 
clear that the W-B occupies a 


position in evaluating the factor of 


COMposilion are 


central 
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general intelligence. When correc- 
tions are made for the ever-present 
restricted ranges of talent, the cor- 
relations between the W-B and other 
tests argue well for both the reliabil- 
ity and validity of the test. 

The principles of test construction 
employed with the W-B have been so 
successful that an alternate form 
(132) and a children’s scale (133) 
have been developed. ‘| hese two 
torms of the test seem to be compara- 
ble to the W-B in their capacity to 
evaluate general intelligence. H0ow- 
ever, the extension of W-B-derived 
subtest pattern interpretations to 
these tests has not been too well sup- 
ported. 

The authors once again bemoan the 
failure of investigators to report the 
range of intelligence of the samples. 
We are espe tally distressed that so 
few of the writers correct their coeffi- 
range-of-talent. 


the raw correlation 


cients tor Too often 
is Meaningless as 
resented in the report. Too often 
the file determine the 
rather than the nature of the 
study determining the data to be col- 
lected. The san plit g of one study 
31) is exemplary, whil 
much thought behind the ma- 
ipulation of experimental variables 
and the statistical analvses (3,31, 45, 
56). 


he tests in 


study 


still others 


“HOW 


Short Forms 


Surprisingly, only three papers (3, 
29, 53) propose new short-form com- 
binations of W-B subtests. First of 
53) thorough silt- 
of all possible two-subtest 


these is Gurvitz's 
com- 
With a large sample the 
I-DS proved best and was contirmed 
cross validation. The second 
of these proposals is the practical one 
by Cotzin and Gallagher (29). They 
show that the four-test form C-S-PA- 
BD correlates .94 with the FS for a 


binations. 


upon 
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cross-validation group of defectives. 
Both the V 10 and P 1Q can be esti- 
mated from this form, which is felt 
to be of particular importance when 
working with mental defectives. 
Their regression equations provide 
appropriate raw scale conversions to 
1Q. Last is the proposal by Alderdice 
and Butler (3), who developed the 
I-D-S-OA combination so that S-B 
could be best predicted tor 
mental defectives. 

Previously proposed short forms 
are evaluated in an additional three 
papers (58, 59, 75). McKenzie (75 
studied the V-J-S-BD form proposed 
by Kriegmen and Hansen. He con- 
cludes this form is a good means of 
estimating the intelligence of mental 
detectives and that it provides more 
diagnostic information than similar 
instruments, In the same year both 
Herring (58) and Hilden (59) re- 
ported thorough empirical investiga- 
tions of previously proposed forms. 
Herring studied the short forms with 
both mental-hvgiene-clinic patients 
and normals while Hilden used only 
The 
more extensive study by Herring (58) 
indicates the best two, three, four, 
and tive subtest combinations. 

Summary. This review covers only 
third as many studies dealing 
with short forms as appeared in the 
previous review (97). Instead of a 
deluge of brief unrelated studies it is 
gratifving to encounter well-planned 
compre hensive studies (3, 53, 58, 59). 
Two of the reports (58, 59) involved 
considerable effort directed at bring- 
ing about an integration of work in 
the area of short forms. One of the 
proposed short forms (29) is the first 
to vield an estimate of both V 1Q and 
P 10 tor use with defectives. Sam- 
pling problems such as range of talent 
and validation have also re 
ceived more consideration than for- 


scores 


mental-hygiene-clinic patients. 


one 


cross 
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merly. This greatly increased sophis- 
tication is most encouraging. 


Applications With Special Population 


Sex differences. A comprehensive 
study by Norman (89) explores sex 
differences in young superiors. Many 
significant differences in subtests as 
well as for the V and P 1Q’s were en- 
countered. In general, men have 
higher scores than women, and the 
greatest difference is apparent on A. 
This relatively large sex discrepancy 
in A was encountered in college stu- 
dents by Guertin (51) and proved 
puzzling. Vane and Eisen (125) found 
that both delinquent and normal girls 
had lower subscores on A and D than 
the standardization population of 
Wechsler. Strange and Palmer (120) 
emploved psychiatric clinic patients 
for their study of sex differences. 
Their results agree with Norman's 
positive findings as do the more re- 
cent conclusions of Goolishian and 
Foster (48) using neuropsychiatric 
patients. Brown and Bryan (18) dif- 
fer in that their normals do not show 
the superiority of the males so clearly, 
and patterning is somewhat different 
than found by other authors. 

Miscellaneous populations. Geriat- 
ric changes in W-B performance were 
studied by Birren (13) in an unusual 
wav. Instead of merely evaluating 
subtest scores he studied the factor- 
His Ver- 
bal Comprehension Factor held up 
well with advancing age, while his 
Closure Factor (performance) showed 
sizable decrements. As a final com- 
ment Birren states, “It is apparent 
that the W-B does not include tests 
of many of the known primary abili- 
ties. For this reason, it is of limited 
use in attempting to describe age 
changes in the intellect.” 

Colored and white neurotics were 
matched for the usual variables by 


scores of geriatric subjects. 
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Davidson et al. (30). The A subtest 
and performance scores were lower 
for the colored. Goldstein (47) dis- 
cusses problems encountered in using 
the W-B in South Africa. <A great 
many changes had to be made to 
adapt it to this population. Maleci 
and Montanari (79) present a similar 
report on the adaptation of the test 
for use in Italy. A report from Cal- 
cutta University (21) cites the same 
sort of problems in adapting the test 
for India. V and PA required the 
most alteration. Aeppli-Tanner (2) 
found she could employ a German 
translation with 
“without extensive alterations.”’ 
Gross quantitative results compared 
well with American norms; however, 
subtest discrepancies were present. 

While not too much has been pub- 
lished about the use of the W-B in 
vocational counseling, a few studies 
appeared during this review interval. 
Of particular importance is Patter- 
son's (93) guide for counselors. Mean 
W-B scores are presented for each of 
16 broad occupational groups by 
Simon and Levitt (108). Another by 
Ladd (70) discloses higher V 1Q's for 
students in academic teacher-prepa- 
ration areas as compared with non- 
academic teacher-preparation areas. 
Those in the 
showed higher P IQ's than those in 
the academic areas. The authors cau- 
tiously suggest there might be some 
implications for counseling. Merrill 
and Heathers (84 norma- 
tive study setting forth W-B centile 
scores for a college counseling-center 
group and also for college student 
volunteers. They found little differ- 
ence in the 1Q’s for the two samples. 
Thev concluded that those with per- 
sonal difficulties were not likely to be 
significantly lower on the W-B. 

Summary. Sex differences 
normals and_ psychiatric 


Swiss adolescents 


nonacademic areas 


present a 


with 
subjects 
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seem to be rather well established. A 
study employing a_ representative 
sample of the general population to 
identity these sex differences appears 
much needed at this time. It would 
be of interest to see whether sex dif- 
ferences appeared in the standardiza- 
tion results of the new Wechsler 
Adult Intelligence Scale. Perhaps dif- 
ferences in perlormance by the sexes 
level ot 
versed in direction at 
so that the 


test 


at one intelligence are re- 
i different level 
total mean differences in 
minimal. Dif- 
ferences in A, D, and PC are repeat- 
edly encountered (48, 89, 120). Re- 
searchers ns should take 


heed of this and spec ify sex as well as 


perlormance are 


and clinici. 


age when proposing patterns or ap- 
plying pattern anal 
Some of the more familiar popula- 


VSes. 


tions of past rese irch have received 
minimal attention recently, 
some new interests z. 47, 49, 
70, 79, 84). It seems appropriate 
that as the problems of the past are 
resolved, new interests should appear. 
Accompanying this the 
extension of the W-B to foreign popu- 
lations. 


while 


appear 


progress is 


ind Critiques 


, 
Refinements i 


Administration and scoring. Kit- 
ind Blumberg (66) present a 
supplementary guide tor administer- 
ing and scoring the W-B. It includes 
scoring principles and examples as 


Zine! 


well as good descriptions ot conven- 
tional subtest rationale. Cohen (23 
Was interested in scoring and admin- 
istration biases. He studied samples 
of W-B's from 13 examiners and was 
able to identify evidence of examiner 
biases that would serve to reduce re- 
liabilitv. Those subtests that show 


lowest reliability were those that pro- 


duced the most systematic interex- 
aminer disagreement. This raises the 
possibility that much of the varia- 
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bilitv of subtests that is ascribed to 
unreliability may be systematic var- 
iance related to the nature of the ex- 
amiuner. 

Item order of the various subtests 
again receives some consideration by 
Norman (89). An incidental finding 
of his larger study was that the order 
of difficulty for superiors agreed bet- 
ter with the order found by Jastak 
than with Wechsler’s (131). Mech 
(83) presents the order of difficulty 
for high school students. He suggests 
that the item order be changed and 
then appends the statistically wise 
but impractical condition that this 
‘* 2. applies only to students in the 
12th grade.”” Mech is faced with the 
enigma of establishing an order of dif- 
ficulty that will apply for any group 
of subjects. It would seem nec essary 
to revert to Wechsler’s intent of 
establishing the item order on a repre- 
sentative population. Different or- 
difficulty found with small 
restricted portions of the whole popu- 
lation would not then be a basis for 
challenging the originally 
order. 102) 


vised sequence of V 


ders of 


proposed 
Russell proposes a re- 
words for 
with neuropsychiatric patients. 

Other subtests have been studied. 
For example, Luchins and Luchins 
(74) examined the effect of varving 
the instructions on the DS subtest. 
This well-designed study permits the 
following conclusions: (a) initial test 
score cannot serve as a reliable index 
of learning, (6) speed emphasis may 
retard peritormance, and (c) Wechs- 
ler’s directions ambiguous 
that they produce greater variability 
among the subjects as compared with 
the experimental directions. Guertin 
(51) was interested in a different sub- 
test. He studied the effect of instruc- 
tions and item order upon A. Con- 
trary to the investigator’s expect- 
ancy, the superior subjects of the 


use 


are so 





242 W. H. 


study were not threatened by en- 
countering difficult items early in the 
presentation. In fact, they accepted 
the challenge and performed better 
than with conventional order of pres 
entation. It seems possible that men- 
tally subnormal subjects might show 
just the opposite reaction, but this 
has yet to be verified. 

Eglash (38) studied the relation- 
ship between the FS score and the 
shoes item on C. He finds some justi- 
fication for relating the number of 
lLiow- 
ever, the spread between one and two 
such that it would be a 
more appropriate cutting point than 
the one in Following up 
tion, Armstrong (8) re- 
item, and the r be- 
tween C and the FS rose from .42 to 
A8. The OA subtest was the subject 
of the 
(106). They suggest a presentation 
method for OA tl 
described several 
Derner and Aborn 
in our last review. 

Statistical analysis. An item anal- 
vsis with high school students was 
made by Mech (83). Item order and 
discriminative values are presented 
for each item. Stanley (115) discusses 
what he feels good clinician 
‘The article deals with 
what to tell the psychiatrist when the 
V 10 and P 1Q are both larger or 
smaller than the FS 1Q. A rather 
mathematical description of the fal- 
lacy of averaging 


reasons given to intelligence. 
reasons is 
current use, 
on this sugges 
scored the shoe 
article by Shannon and Rossi 
it was completely 


earlier bv 
and was 


Vears 


covered 


“ever 


| ' = - * 
Shot ld KNOW. 


averages points out 

sing the V 10 and P 10 
meaningless “average” IO 

that could not be expected to cor- 

respond to the FS 10 

(24, 26 

the W-B 


that aver. 


vields a 


Cohen ( leverly factor- 


analyzed subtests using 


three tvpes of psychiatric patients. 
He makes a methodological contribu- 
tion by demonstrating that the fac- 


tors uncderlving success on a test are 
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a function of the sample of people 
taking the test. The material is pre- 
sented in detail in a relatively non- 
technical fashion in another paper 
(25), and it is highly recommended to 
clinicians. Cohen identified the three 
factors: Verbal, Nonverbal, and Dis- 
tractibilitv. Similar factors were 
derived by Birren (13) for a group of 
elderly subjec ts. 

were respectively: 


His factor names 
Verbal Compre- 
hension, Closure, and Rote Memory. 
In addition a fourth factor is tenta- 
tively identified as Induction. Alder- 
dice and Butler (3) factor-analwzed 
the performance of mental defectives 
but found only a general and a per- 
formance factor. Whiteman and 
Whiteman (137) re port on the apph 
| §=Jastak’s predetermined 
clusters: Reality perception, Psycho- 
motor efficiency, Language polarity, 
and Motivation. Wheeler (136) also 
reports the results of a factor analvsis 
of the W-B and other 
Summarizing, there 
fairly good overlap of factors with a 


cation of 


imstruments 


seems to be a 


few factors appearing only with cer- 
tain subjects. Most frequently en- 
verb il, al 
ance, and an attention factor. 
Miscellaneou Gurvitz 54 
reached his limit of professional tol 
erance and wrote a critical article o1 
the defects in 


countered are a pertorm- 


standardization sam 
pling and procedure for the W-B. 
The criticisms are too many to list 
and reading of the original article is 
essential for anyone planning to em- 
ploy data from the standardization 
He also scolds Wechsler for his care 
lessness in changing subtest instruc- 
tions between editions of the manual 
yet not presenting any restandardiza- 
tion. Gurvitz makes a most 
able plea for more adequate standard- 
ization sampling in future work. 
The relationship between learning 
ability and W-B performance was 
investigated by McLean (76) By 


reason- 
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studying neuropsychiatric patients 
with both the W-B and conventional 
learning tasks, the author was able to 
reach several conclusions: (a) higher 
V’ IQ is indicative of ability to learn 
in verbal situations, (6) higher P 1Q 
is indicative of ability to learn in per- 
formance situations, and (c) subtest 
variation is inversely related to learn- 
ing proficiency. The touch of the 
learning psychologist is also encoun- 
tered in the experiment by Burik 
(19). He found that motor test- 
scores, such as rate-of-tapping, were 
most closely related to DS. There 
was also good evidence that the in- 
cidental learning of the symbols was 
not related to the DS score. He con- 
cludes that the DS subtest should be 
regarded more as a test of visuomotor 
coordination than as a test of new 
learning. 

Iiypnosis was used by line (67) to 
demonstrate the W-B in action. He 
hy pnoti ally regressed and proure ssed 
i 22-year-old W-B's were 
obtained in these two states as well 
as at normal age and in a waking 
state. Under the varying conditions 
the IQ remained surprisingh 
stant although weighted scores were 
necessarily diminished for both the 
CA’s of eight and 65 years. The 
changes in subtests with progressed 
age conformed well to 
clinical expectancy. The DQ also is 
appropriate for that age group. Even 
item six on BD, which is not in the 
correct order of difficulty, was failed 
when the subject “believed”’ she was 
65. It would be expected that the 
“conforming” would 
for failing the last and more complex- 
appearing design which employs all 
the blocks. Not much is proved, but 
this study makes interesting reading. 
A thesis by Ficca (39) relating auto- 
nomic features to W-B performance 
sounded interesting but was not seen. 

A series of studies by Stacey and 


woman. 


COll- 


remarkablv 


subjec t select 
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others (113, 114, 115) investigated 
Gerstein’s hypothesis about concep- 
tual performance. Gerstein desig 
nates the descriptive definition as 
lowest in intellectual requirements, 
while the functional is intermediate; 
and the conceptual definition is most 
associated with high intelligence. 
Subnormals failed to confirm the pre- 
dictions of frequency of occurrence for 
the three types of definitions for V 
(114). A similar result was obtained 
when concentrating on S§ 
113). In the most recent study, 
Stacey and Spanier (115) switched 
to superiors for a study of V re- 
The functional was actually 
associated with 
than was the descriptive-tvpe delini- 
tion. This finding was in line with the 
earlier two studies (113, 114 
that the descriptive-tyvpe 
ition is of a slightly higher level 


answers 


sponses. 


lower intelligence 


and 


sugcests 


} ; 
dgeill 


than the functional detinition. 
There seems to be a 


Summary. 
welcome increase in comprehensive, 
theoretical articles this time (8, 13, 
23, 54. 76, 83, 89). 
tion of researchers seems to be ad- 
Vancing. Some evidence of 
found in the relative large number of 
factor-analytically designed studies 
in this review (3, 13, 24, 26, 136). 
Also new is the learning theory de- 
sign focused upon test theory ration- 
ale (19, 76). In addition to the more 
comprehensive studies, there is an 
ever-growing body of knowledgeabout 
the rationale of the individual sub 
tests. 


The se yphist ica- 


this is 


THe WECHSLER-BELLEVUE AS A 
DIAGNosTIC AID 


The use of the W-B as a diagnostic 
aid implies an assumption most re- 
cently expressed by Jastak “*... in- 
telligence is not a global trait but a 
general and pervasive part function 
of the personality.” Persistent criti- 
cism of such an assumption is ex- 
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pressed most recently in various re- 
search and by Schofield (105). 


As a Measure of Emotional Factors 


Brower in 1947 proposed that a 
negative correlation existed between 
the Hy, Hs, and Pd scales of the Min- 
nesota Multiphasic Personality In- 
ventory and the W-B IQ, the prob- 
able rationale being that neurotics 
and persons with character disorders 
would show intellectual impairment. 
Winfield (138) checked this hypothe- 
sis and found no support for it. 

Other interest in the relationship 
between personality tests and intel- 
lectual performance is confined to the 
Rorschach test. Probably most com- 
prehensive is the study by Holzberg 
and Belmont (61). They hypothe- 
sized a number of relationships be- 
tween Rorschach signs and W-B test 
performance, but only four of their 
many predictions were substantiated 
They set forth their empirical find- 

under eight fe Ror- 
schach performance with counter- 
parts in W-B performance. A single 
hypothesis was tested by Burnham 
(20). He failed to find support for 
the hypothesis that there is a rela- 
tionship between 77 and PA. How- 
ever, the thesis by Spaner (111 
touched upon related material. He 
was able to demonstrate relation- 
Rorschach 
Five of the ten hv- 


ings atures of 


ship between W's on the 
and PA score. 
pothesized relationships between 
Rorschach and W-B_ performances 
122 
ition- 
ship between Rorschach features and 
W-B performance. 

Summary. Attempts to validate 
the general assumption that some- 
thing other than a mere IQ mav be 
extracted from perform the 
W-B have been disappointing. How- 
ever, as in the case of the comparison 
between the Rorschach and the W-B, 


were confirmed. Anot! thesis 


not seen, also deals with the rel 


tice on 
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it might be reiterated that (a) the 
Rorschach statistically 
valid measures of personality have 
not been established, and (6) the 
rationales for the subtests as pro- 
posed by Wechsler have been chal- 
lenged. 


scores, as 


As a Measure of Change Following 
Therapy 

Markwell, Wheeler, il dl Kitzinger 
(81) recorded the W-B_ periormance 
of schizophrenics before and after 
prefrontal lobotomy; and Smvykal 
and Wilson (110) did the same with 
electroconvulsive therapy Mark- 
well, Wheeler, and Kitzinger found no 
statistically significant differences be- 
tween pre- and postoperative subtest 
performance; Smvykal and Wilson 
found some gross group differentia- 
tion but considerable individual dif- 
ferences to attenuate the positive 
aspect ol their findings. 

In the Smvkal study, the W-B was 
administered before treatment and 
after the fifth and tenth administra- 
tion of shock (shock given 
twice a week for five weeks Inter- 
esting results demonstrated (a) the 
percentage of IQ's of 35 or below de- 
creased 50 per cent atter the fifth 
shock; the Markwell group found that 
lobotomy i 


being 


creased the testability of 
the patients by the 
highest point of efficiency was at the 
midpoint of treatment, after the fifth 


20 per cent, b 


shock; and (c) the preshock pattern 


corresponded to R ipaports chronic 
s¢ hizophrenic group, while the post- 
fifth shock collated highly with his 
acute 

Summary 
demonstrate 
performance reliably following either 
prefrontal lobotomy or ECT. How- 
ever, the generalization of the two 
researches, reviewed in this se ( tion. 
must be delimited by the restricted 
size and nature of their sample popu- 


group. 
Research failed to 


14 ae ; 
ditterences in. subtest 
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What the research does at- 
test to is the fact that, as the result 
of therapy, the patient does become 
more amenable to testing. 


lations. 


Mental Deterioration Index 

Validity. Tests of the validity of 
Wechsler’s (131) Mental Deteriora- 
tion Index (MDI) have produced 
generally negative results even though 
Howell (62) found that, . it may 
be tentatively concluded that Wechs- 
ler’s assumption of deterioration at 
all levels is and 
28) culled the test protocols 
for 100 inmates of San Quentin pris- 
on. In 
the total population was divided into 
twelve hve-vear 
ages 15-19 
These investigators 
and large the verbal 
with age 
scores dec lit e,” 
ither tha 
ratio, as the 


4s 


correct.’ Corsini 


Fasse tt 
this cross-sectional study, 

periods, starting 
and going to 70-74. 
found that by 
test scores in- 
while performance 
V—P 


n the “hold,” “don't 


with 


crease 
suggesting a 
ratio r 
hold” 
measure ol 


more sensitive 


menta deterioration. 
1 good agreement 
with those reported by Birren (13). 
ind Sloar 10 tested the 

of Wechsler’'s MDI on a 

mental Thev 

culled their protoc ols from the files of 
thie Lit coln State School and Colon, . 
excluding those individuals who had 
not received S-B IQ's of 42 or better, 
and who showed evidence of psvchot- 
organi The results 
with the W-B were compared with 
those on the Arthur Point Seale and 
the S-B. 
although 
j 


qaeme 


These tindings are i 


detectives. 


i or processes. 


The ri sults suggested a) 
the Arthur and the S-B 
mstrated a negatively accelerat- 
ing funetion with regard to CA incre- 
the W-B scores tended to 1n- 
rather than decrease, and (6 
the W-B scores did not differentiate 
from their 
earlier MDI scores obtained between 
the ages of 15 and 24. The authors 


ments, 


crease, 


the older group (30-55) 


conclude that, “...the ‘normal 
deterioration’ which Wechsler found 
for older subjects may have been an 
artifact due to faulty sampling, at 
least for the lower intelligence groups.” 

The study by Fox and Birren (41), 
however, that when psy- 
chotic or neurological dvsfunctioning 
is excluded, a group of sixty-year- 
olds are significantly different in sub- 
test patterning from a group of 69- 
vear-olds. Fox and Birren find that 
only two of Wechsler’s “hold” group 
“hold,” viz., V and ¢. 

Glik (46) offers a.very interesting 
thesis to explain the inability of meas- 
ures of deterioration, as Wechsler’s 
MDI, to reflect intellectual deteriora- 
tion adequately. He assumes that 
the manner of measuring present func- 
tioning is in error. Glik reasons that 
the ability to recognize the meaning of 
items one once knew is a more sensi- 
tive measure of deterioration than 
measuring what items the subject 
can actually recal] in the present. 

Testing this assumption on the V 
and J subtests, Glik found a signifi- 
cant t between recall and recognition 
on I items only. Glik generalizes his 
results to verbal questions without 
any really adequate substantiation 
of this point, although his proposals 
surely merit further study. At any 
rate, Glik’s results do suggest that a 
discrepancy score between recall and 
recognition mav be more meaningful 
than the method of recall now used 
with V and J items. 

In senile psychosis. The major 
question to be asked here is: does the 
addition of a psychotic process super- 
imposed upon the “‘normal’’senescent 
decline alter the W-B 
performance? 

Berkowitz (11), Botwinick and Bir- 
ren (15), and Doerken and Green- 
bloom (36), tested the efficiency of 
the W-B to differentiate ‘‘normal” 
and “abnormal” seniles and found 


suggests 


significantly 
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Botwinick and 
MII does 
oup of 
roup ol 
investi- 


positive results, while 
Birren (16) found that the 
differentiate a control 
erimental ¢ 
seniles with psychosis. The 
gators of the latter study 
Birren (13 en 
these aged caused ; 
deficit in ? 10 than V 10 a 
there wis good evidence of inro 
Verbal performance also. 

In organic bra 
(6) and Ptacek 
ducted irch in 
termine 
identifying 


not 
seniles from an eNni 
as well 
- ill- 
eater 
Reis h 


yhasize that men 


ness in 
ds on 


n disease. Anderson 
ind Young (94 
rest attempts to de- 
how etfective the MDI 

deterioration in 


con- 


mental 
sults point to a re- 
jection of the sensitivity of the MDI. 

Anderson (6) addressed himself to 
the effects of laterality of localization 
of brain ome and its effect upon 
the MDI, vith the results being nega 
a sith ugh subject 


idet ifren 1 


ore nics. The re 


the were 
eurologic. i 
they were not 
MDI 
Wheeler and Wilk 
the Hewson ratio, 
mula devised to differentiate organics 
iorganics, but found it to be 
ibility to make individual 


cre 


tested 


for- 


135 


empirical 


trom not 


lacking in 


pre ad 
nor (90) tested 


In neuro 
the Ipon the MDI. 
“hold”’ 


The results ite that the 
ratio is inapplic 


ctions. 
O'Con 


IrOSis 


effect of 
versus ‘“don’ id”’ 
ble to neurosis 
Summary. Research attempting 
illy to detine the utility of the 
is eve ania in nev. tive re 
The MDI has proven to be in- 


in senility, 


empiric 
MDI 

cults. 

sensitive 
with or without psvé 
brain disease, and ne 
J 


to deterioration 
hosis, organi 
The re- 
search findings did suggest that the 
“hold’’—“‘don’t hold” ratio 
was not sensitive anges due toa 
deteriorative process, but that the 
verbal-performance dichotomy was. 

Regardless of the nature of the 


irosis 


present 


to cl 
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findings per se, as we look back over 
the vestigations conducted 
in this particular area with the W-B, 
one is struck by the number of impor- 
tant methodological criticisms that 
may be leveled the experi- 
mentation. In first place, two 
validity 
sample 


var ious in 


against 
the 
studies use highly restric ted 
Both these in- 
vestigations rejected the utility of the 
MDI with their populations, but it is 
hard to generalize as to the validity 
of the MDI with other populations 
It is most difficult to pre 
ously an index of 
would cover the full range of int 
and age, Probably Wech 

Was too ambitious. It is still 
difficult to make cross comparisons of 
studies with such wide ranges of vari 
to relate to 
CA, edu 
Such 
id the 
earch worker 
interpret are 


populations. 


spose oie 
deterioration th: 
i 
sl] 


more 


gence 


known 
10, 


skills. 


ables which are 
subtest patterning, viz 
cation, and vocational 


variables only serve to confour 
data and confuse the 

Even difficult to 
results from a single sample composed 
of both psv¢ hotics and non psve hoti 


organicity due to trauma, : 


more 


nd organ 
due to endogenous « and 
varving de 
ntral involvemen 
another study 
the effect of 
localization of brain dan 
abouthis ownsample 
dominant hemisphere sub-group . . 
was significantly older n the dom1 
nanthemispheregroup,”’ and“... al 
though brain damage was established 


ble doubt... the 
probably 


iuses, 


ICI 
even grees of organicit, 


94 Andet 
attempti 


al d ce 


t 
= 
\é 


son in 
to ce hine 


tosay . non 


beyond any reasona 
criterion of unilaterality is 
relatively 
It appears 
drawn from the 
gations attempti 
fectiveness of the 
tative because of 
tionable sampling. 
parently introduced 


poor.” 

that the 
results of the investi- 
ig to define the ef- 
MDI must be ten- 
restricted or ques- 
These have ap- 
distortion 


conclusions 


into 
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the subtest periormance, confounding 
the results. Yet there is 
agreement that the MDI 
with samples 


uniform 
not 
em- 


Was 
elfective those 


ple \ ed. 


tat > Dp, ans 
Scatter and Pattern Anal) is 


has been 


pertorm- 


Considerable research 


conducted ol patterns ol 
ince on the subtests, and on scatter. 
Since the le —t ré Vit Ww, emp! 


placed on empiricall 


rived 
nto this intriguin 
General fin 
pattern 
rather 
zed. 
stormation 
alleviate 
test itself. 
re cOn- 
individual 
apaport’s 
ostic groups 
Monroe 88), and 
Holzberg (139) all 
yal adjustment was 
mount ot 
criterion 
da direct 
tion on 
in 10 
nd Pur- 
earson r of .31 
level) between 
Hypochondri 
and P vchasthenia 
the MMPI) and scatter on 
Moldawsky and 
that D is an 
anNictv. 


t 01 


anxiety 


sis, Depression 
Scales on 
the subte ts, and 
Moldawsky (87 


rt may tor of 


} 
found 


However, mn 
or Matarazz 
between 
formance on the subtests. Kaldeg 
(65), Love (73), and Wittenborn ance 
Holzberg (139) found no significant 


tor ship nmxiety 
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relationship between variability of 
subtest performance and pathology; 
and Rakusin (98) found “S...a lack 
of uniformity in normal scatter pat- 
terns on vocabulary scatter.”” Raku- 
sin, incidentally, found a significant 
difference on the patterns of subtests, 
for a clinic (ma and non- 


clinic group, but also dis- 


ladjusted 
adjusted 
covered that hese apparent cilfer 
ences disappeared when the efiects of 
age and IQ on total scatter were alle- 
viated by the method of multiplh 
cludes (98 
‘The ditferences obtained could not 
be attributed to maladjustment.” 
Merrill and He 85) conclude 
basis of the col- 

that 


re ey 
covariance. Rakusin cor 


, 
thers 
ir resear( h Or 
“scatter 


al OVE 


is to be 
average 

Jastak found 
hizophret ics, and 
iated by 
Wittenborn 


i chi-square 


different 
isler’s signs, and 
lolzberg (139), 1 


, f +} 
sis OF tne 


ot para- 


-depres- 


performance 
schizophrenics, ma 
alcoholic psve hoti S, and psv- 
chopathie ilities, four 
individual group 
distinguishable. 
Holzberg, Alessi, 
tested the abilitv of seven judges to 
pre morbid intelligence of te: 
itients at Connecticut 
Hospital. In general, the 
judges made their evaluations on t! 
basis of the amount of intertest sca! 
ter in the individual protocols. |: 


tercor! el 


person. 


pe rformances 
and Talkoff (60 


predic t 
' ] ere 
SVCHOTIC Pp 


t 
State 


the J 1dge's 
from .15 to .&7. 
matrix of correla- 
about the over-all 


itions between 
ranged 
However, such a 
1 tells little 


tions 
intercorrelation of the judges, Le., 


estimates 


how well were they doing as a group 
Hence, one of the present 
Gk sim] ly reprod ced the complete 
correlation matrix from that  pre- 
sented in the article, converted the 
rho’s into ranks, and computed a JW, 


writers 
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a coefficient of concordance, a non- 
parametric multiple-rank correlation 
technique, which yields an over-all 
coefficient of correlation. IW was .27 
(not significant) 
that there was poor be- 
tween the judges. The poor results 
of the Connecticut research may be 
a function of the inexperience of the 


hence indicating 


agreement 


judges since four of the seven were 
interns, the « 
members. 
Methodology. Before we discuss the 
research on pattern and scatter an- 
alysis with re gard to the various dis- 


her three being staff 


the writers deem it nec- 
An- 
tecedent to an understanding of the 
ettect ot psychopathology upon W-B 
performance seems to be the need to 


ease entities, 


essary to preface the discussion. 


recognize that factors other than psv- 
chopatholog affect subtest | erform- 
ance, and must be taken into account 
in such analvsis. For instance, Cohen 
(23) found an examiner bias; Collins 
(27) found variation in subtest as a 
function of age; many (18, 48, 51, 89, 
109, 120, 125) found significant devia- 
tions on subtest scores due mainly to 
sex; Aronov (9) and O'Connor (90) 
testify to the etfect of education on 
scatter; and Aronov (9), French and 
Hunt (44), Merrill and Heathers (85), 
Norman (89), O'Connor (90), Ortar 
92), and Schnadt (104) validate the 
effect of 10 level per se on scatter. 
The reader will understand 


knowledge of these findings is propac- 


why 


deutic to an adequate purview of the 
research on pattérn analysis with the 
W-B. In attempts to control for the 
effects of these variables, the research 
worker has relied upon the statistical 
Statisti- 
cally the assumption is maintained 
that if you randomize the effects of 
these uncontrollable, hence, unmeas- 


concept ol randomization. 


urable parameters, equally through- 
out the cells of one’s table, the effects 
of the randomization will be to 
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equalize the effect of these parame- 
ters upon the variance. Such statis- 
tical reasoning has been translated 
into research methodology in the 
form of the matched group design. 
However, certain research suggests 
that the simple randomized design is 
not the answer to a research worker's 
prayers. 

Frank, Corrie, and Fogel (43) and 
Reich (99) have demonstrated that 
we are no longer meaningfully speak- 
about 
phrenics” 


‘“neurotics’ or ‘‘schizo- 


when we i 


ing 
clude wide 
education, 
IQ) in one group. For instance, Reich 
(99) demonstrated that when he cor 

pared the subtest periormance oi his 
group of schizophrenics from Kings 
County Hospital, with data from 
other groups of subjects, similarly di- 
agnosed, presented in previous pub 


ranges of attributes (age, 


lished literature, the rank-order cor 
from .84 to .19. 
Fogel (43) per- 


an analvsis of variance on the 


relations ranged 


Frank, Corrie, and 
formed 
subtest performance of like-diagnosed 
presented in the literature 
included wide ranges of IQ, 
education, age, d found them 
to be statistically difierent If one 


Cases 
which 


etc., al 


incomparables, 
bound to be spurious 


begins by comparing 
the results are 
Subtest 


distortion is not just a 


function of research methodology, 


but mavy accrue as a result of factors 
inherent in the W-B itself 
stance, Cohen (26) tinds th 
B subtests do not 


For in- 
it the W 
alwavs appear to 
different 
Marks 80 
cized Wechsler’s qualit pattern 
approach to scatter analvsis, since it 
permits of a wide range ol 
variation coding within a give! 
nost category, suc h ; 


measure the same factors in 
tvpes of patients criti- 


itive 


subtest 
diag- 
is trom +--+ to 
—. This tends to “load the data” in 
terms of a lack of reliability and 
validity of Wechsler’s diagnostic pat- 
tern. It appears ‘‘safe’’ to review re- 
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search on pattern analysis in the dif- 
ferent diagnostic categories now that 
the ‘‘warning alarm’ has been 
sounded. 
Schizophrenic patterning. Harper 
55 applied Fisher's discriminant 
function to pattern analysis. In gen- 
eral, Harper felt that the pattern of 
differentiated group of 


schizophrenics from his normals, and 


subtests his 
further, that the subtypes, paranoid, 
and catatonic, were sig- 
tiated from the rest 
the 


hebephrenic, 
nificantly differe 
of the schizophrenics and nor- 
mals. 

Harper wrote, as regards the re- 
gression equation he had formulated, 
The 

chts woul 


] ; 
@ TFOM) « 


extent ich the regression 


Cross validatior 
gestion by Solo 
mon Machover, Reich (99) performed 
this test. He found that the Harper 
equation identified 65 per cent of a 
new schizophrenic population at 
Hospital as schizo- 
concluding that, i 


not know! 


was needed. so o Sut 


Kings County 
phrenic, general, 
individual 
On a further suggestion by Machover, 
Krank (42) tested the s of 


Harper's equation when applied to a 


differentiation Was poor 


cllectivens 


heterogeneous group of psychotics 


excluding schizophrenics) at Kings 
Hosp tal. He found that the 
Harper equation misidentified 47 per 
cent of this popula 
phreni thereby question 
utility of the formula. 
Rogers (100) compared 
phrenics and neuré 
postulated by Rabin, Rapaport, and 
Wechsler, and found only ei 
them tenable. However, 
grouped the var subtvpes of 


Counts 


schizo- 


ing the 


Lol as 


schizo 


tics on hiteen signs 


eight of 

Rog rs 
ious 
schizophrenia under a single heading. 
On the other hand, on the basis of a 
comparison of 50 neurotics and 50 
schizophrenics, little agreement was 
found between Wechsler pattern and 


249 
Whiteman and 


SUCCESS 


psychiatric diagnosis. 
Whiteman (137) had 
using factor analysis on the perform- 
ance of 50 schizophrenics and 50 
police applicants, finding that clusters 
entitled “reality 
“nsvchomotorefficiency”’ 
differentiated the groups. 
(77) ( he ked Wee hsler’s 
schizophrenia on equated normal and 
schizophrenic groups for 340 male 
veterans of World War II. “Only one 
of Wechsler’s sig: 
nificantly between the 

. but it identifies schize phreni s 
normals.” What 


very 1 


SOTLIE 


perception” and 
significantls 
MeNeal 


fc ir 


signs 


riminates sig- 


): 
Is Gils 


. groups 


more normal than 
teresting and 


Mon- 


pears tO be al 
thought-provoking finding is 
roe’s (88 n that se hizo- 
phrenics with average or above aver- 
ve intellectual mid no 
that ex- 
haracteristic of 
low 


demonstrati 
more 
scatter than neur . and 
treme scatte! 
schizophrenics with ob- 
tained IQ. 

Patterning in the affective disorders. 


in- 


The writers could tind but 


one 
vestigation in this area during the en- 
tire five-year period, an apparent 
sign of the need for more research. 
Waldfogel and Guy (126) fotind dif- 
between manic-depressives 
but none between de- 
ic states of the manic- 
However, fur- 
that the pa- 
rameter of age was contributing an un- 
known amount of variance to the pat- 


ferences 
and contre ls, 

pressive or mat 
d 


epressive psve hosis. 


ther analvsis revealed 


tern, hene c depre¢ lating the meaning- 
fulness of the results. 
Patterning in neurosis. Monroe 


88) using a factork 


the effect of levels of 


] design to test 
adjustment, in- 
found little 
neurotics and 


well-adjusted normals when the in- 


telliger ce, and locality, 
differentiation between 
teraction of these effects were deleted 
from the Variance 
Schillo (103) found no difference be- 
tween neurotics and normals on dis- 


bet ween-group 
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parity between verbal and perform. 
ance IQ, periormance on individual 
tests, or rank-order correlation of test 
performance, although his 
showed 
own group mi 
Schillo’s 
individuals subject 


neurotics 
greater variability from its 
in than the normals. 
though, included 
to anxiety reac- 
tion, obsessive-compulsive reaction, 


sul lect 


and mixed neurosis; and no data are 
offered as to the possible differences 
within this group as related to the 
subtypes. 
Patterning in 
Ald rdic eS 


' ’ 
the only 


mental 
and Butler (3 
investigators 

mental 
Phe, present 


deficiency. 
seem to be 
interested 
identil' il ya 
tern. 
proaches to pattern a 
pare their tindis 
vious 
mental deficiency, 


defe tive pat- 
thoughtful 
nalysis and com- 
gs with those of pre- 
Their pattern for 


While not a sharply 


<ap- 


-| 
WOrners. 


delineated one, is similar to those pre 
It is most at 
ance with the pattern initially 


posed by Weclisler (131). Little work 


seems to have been done in terms ol} 


viously reported. vari 


‘ 


an understanding of the performance 


of the mentally defective subject on 
the subtests of the W-B. One investi- 
gation, by McPherson and Fisch (78 
attempts tounderstand the defective’s 
poor performance on S. Using ap- 
30 subjects with an 1Q 
range ol they found 
that 66 per cent ot the failures were 
what Mi Pherson 


and Fisch term “‘learned negativism.” 


proximately 
irom 55 to 85, 
due to evasiveness, 
The authors suggest that as a result 
of such behavior, testing ma 
vield an exact measure of the 
tive’s ability. 

Patterning | 
27) investigating the effect of epilep- 
tic involvement found that onh 


im organicity. 


diagnosed with orgal icity 

strated deterioration. Further 

sis revealed the superiority of the 
performance of the endogenous to the 
exogenous group. <A comparison oi 
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Collins’ 400 outpatient epileptics 
with the protocols of institutionalized 
groups of epileptic S, psve hotics, neu- 
rot Ss, and psychop iths, pre sented in 
previous research, showed CO sider 
able differences. In the 

cerebral arterios¢ lerotic patients com- 
pared with 26 patients with 
forms of cerebral pathology " 
for age, 
heim (91 


ences On 


1. ) 
naivsis of £0 


“other 
matched 
education, and 1Q, Oppen 
found no significant differ 
subtests or in total varia 
bility of performance. 

In one investigation 
Brown (35) found that 
inverse relation ship betwee! 


Hughes 


signs on the 


and the actor 
derived 
intracranial pathology 
Diers 


analyzed the protocol 


inVestiy 


diagnosed 

s. When 
controls, the sclerotic group demon 
strated a D 


irson 


; » 
iower memory S| in tor 


i Superior PC; however, a Pt 


rot .17 not siv liicant W 
} 


between Wechsler’s organi J11S 


Phe a 


s { btained 
and 
actual organicitv. offer 
the following : 
their results: 
Oreal ic 
index ot dé 
ler-Bellevuc 

n indicator of existit 


1, 1 


age in multiple scl 


cortical pathologic cl 
the population with m 
comprising this stud 


Patterning of the st + ith 


The 


patterning of the sox lO} ith psycho- 

under 
the last 
review. 3 nstein al d Corsini 12 
te ted the validity ol Wechsler’s as- 
sumption that performance 
are higher than verbal scores 


path, delinquent) has 
leral 


come 


consi era le scrutiny since 


SCOTesS 
whit h 
they could not reject. By the method 
of forward and backward presenta- 


tion of the subtests, they rejected 
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the assumption that this was a spuri- 
ous difference due only to the fact 
that the performance part of the test 
is administered late in the examina- 
tion period permitting the sociopath 
to adjust more adequately. 

Graham (49) found that there was 
a significant relationship between 
performance on the W-B and school 
performance, and that the scatter- 
vram of Wechsler’s adolescent 
chopath closely approximates that 
of the unsuccessful reader. Graham 
makes the appealing suggestion, “It 
not seem 
that the 


psy- 


unreasonable tO as- 
(Wee hsler’s 
psyvel opath is typical of 
lly retarded youth with- 
his moral qualities.” 

1 125) tested the 
the W-B_ between 


il tellige: ce, 


does 


protile 


socioeconom ackground) of de- 
t virls, and 
eher scores On Dand 
juent than for the 
about all they 


| . rt ’ nrialse . 
mquent an HG ll) cl 


Srynte 
pO 


I for 


delingu 


dl none, 


te the two groups. 
mpared the subtest 
matched group of 
| 


IwhoOsed 


ound 


as psve hopaths 


1 “‘nonpsychopathic’’ inmates, and 


Weehsler’s standardization group of 


psychopaths. Through a chi-square 
1 f-test of Sigt ifi ince, the results 
licated that there was no charac- 


teristic sul 


LTi¢ 
test pattern for the psy- 
chopath, and, further, that there were 
no differences between the “psvcho- 
and the 
mate, and 


“normal” prison in- 
two and 
Wechsler’s psve hopathic popul ition. 

In a study, Clark and 
Moore (22) had previously attempted 
to differentiate the test patterns of 
subclassitications of military offenders 
(no NP disorder, immaturity reac- 
tion, and pathological personality 
types—presumably psychopaths) but 
with no success. 


ot 
} athe 


between these 


similar 
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If one considers unauthorized dis- 
charge from a medical hospital as 
asocial or antisocial behavior, Thur- 
ston and Claden’s (123) study on the 
irregular discharge of tuberculosis 
patients becomes pertinent here. 
However, the W-B could not differ- 
entiate the “AWOL” group from 
those who remained in the hospital, 
either in terms of 1Q or subtest pat- 
terning. 

Summary. Our somewhat 
diced eve continues to reject 
assumption of unique subtest per- 
formances by schizophrenics. The 
at best inconsistent, but 
their very inconsistency may testify 
to the erroneous assumption that 
will control for the 
effect of the parameters upon subtest 
But the reader of such 
researches is left in the predicament 
of being uncertain whether to at- 
tribute the negative results to either 
the methodology or to the hypothesis. 
mental 
patterning seems better established 
The search 
for patterns of performance charac- 
teristic of the organic brain-damaged, 
the manic depressives, and neurotics 
have been rather fruitless but, as 
with the sampling 
methods have left much to be desired. 

Researches with the sociopath indi- 
some points of agreement but 
there is still much disagreement. The 
pattern pre posed by Wechsler, ‘‘ado- 
lescent psychopath,” is coming to be 
evaluated in terms of the subjects’ 
academic training, motivation, and 
general background of experience. 

The over-all impression gained 
from the pattern studies reviewed is 
that the are 
Patterns suitable for clinical use will 
not be forthcoming until methodo- 
logical improvements appear. Fur- 
ther advances in typology can do 
much to establish suitable diagnostic 


jaun- 
the 


results are 


randomization 


periormance. 


lor groups of defectives 


than for schizophrenics. 


S¢ hizophret ic S, 
t 


cate 


findings inconclusive. 





252 


criteria for which patterns can legiti- 
mately be sought. 


GENERAL SUMMARY 
The past five exhibit 
major changes in the resea4re h trends 
with the W-B. In the first place, 
there has been a realignment of the 
general nature of 
search with 


vears two 


the studies. Re- 
psychiatric syndromes 
has been reduced, whereas a larger 
the reviewed 
deal with the W-B as a test of general 
intelligence, relia- 
bility, Sec- 
ondly, it has been noted throughout 
the review that the number of well- 
controlled and statistically sophisti- 
cated studies has markedly increased. 


proportion of studies 
investigating its 


validity, rationale, etc. 


One rather new and somewhat un- 
expected trend is the demonstration 
of sex differences. 
additional uncontrolled factor in 
previous studies and has important 
implications for the use of test pat- 
terns, scatter, and such. Of 
the need for a larger, more repre- 
sentative sample for the demonstra- 
tion of sex differences, or their ab- 
different levels of intelli- 
gence still remains. It may be added 
that another factor that needs con- 
trol in research is that of 
nomic level. Social class has been 
found to have a bearing on intelli- 


It points up an 


course, 


sence, at 
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gence test achievement. This remains 
comparatively with the 
W-B. Moreover, it may have an im- 
portant relationship to the Verbal 
Performance discrepancies noted in 
psychopaths, delinquents, and others 
whose status in 


unexplored 


such classifications 
may be related to socioeconomic level 
as well. 

When one looks at the 


with various psychiatri: 


work done 
populations 
by means of scatter and patterning 
methods, 

that 
either in 


clude 
added” 
methodology or in detinite 
Phe results are still “in 
clusive.”’ It mav be 
whether it 


one can fre dil Col 


“nothing new has been 
findings. con- 
Wwe ndered 
said that 


ire no positive results ste id 


should not be 


ng an open crack the door, 


implicitly indicatil ga still-tenacious 
clinging to an overworked hypothesis 

Mavbe, with the creation of a 
newly standardized instrument, simi- 
lar in structure to the W-B, but not 
suffering from weak- 
nesses which have been pointed up 
in this review and in 
fruitful research with 
analvsis will be forth« 
ever, such res¢ 


the numerous 


previous ones, 
more pattern 
ng. How- 
irch cannot conti ue 
ves to the 


to close the « kness of 


the criterion itself psy hi 


tLric diag- 


nostic classification 
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THE INTERMEDIARY 
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What 


personal 


we are here calling inter- 


perception has also been 


called empathy, SCHSILIVITV, under- 


standing, diagnosis, social percep- 


the follow- 
ing procedure has been much used 


tion, etc. In recent vears, 
in the investigation of interpersonal 
perception: A 
called the Other, 


’ ° ‘ 
cescription, 


person, or a group, 
sel f- 


per- 


provides a 
out 

sonality inventory or rating scale. 
Another called the Judge 
predicts the Other's self-description 


as by filling 
person, 


filling out the same inventory or rat- 
ing scale with the responses he pre- 
dicts the Other would give. The 
Ige’ then the 
to the 


Jucige’s accuracy score 1s 


ness of his predictions 
actual responses of the Other 


Ton g., 8,14 
l; 


this op ration has quali ed almost by 


any psve hologist sic 


definition as a measure of the Judge's 
ability thize with the Other 
It does seem, at first glance, 

] vige should 
into” the 


++ 
Ings, al 


to emp 
have to 
Other's 

self Con 


lict 


titudes 


like, if he is to pre accurately the 


Other's self-descriptions 
parently straightforward 
however, Vit le i | 
data. Rather, bene 
its surface, we find an intricate 
plex of processes al d 


does not, 
wholesome 
coni- 


compo! ents 


\ draft of this paper was pre 
symposium on interpersonal perception du 
the APA meetings, September, 1954. This in- 
vestigation was supported by a_ research 
grant (M-650) from the Institute of Mental 
Health, National Institutes of Health, Public 
Health Service. 
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se sets, such as acquiescence, in 
Scoring the Judge's pre- 
ons and the Other's self-descrip- 


taking. 


s with such an acquiescence key 
vs that the ; 
luenced by l 
t acquiescence | nceies of 


ve and the Other. 


wecuracy score can be 
between 
the 
If they have 


iv the a qui- 


iT = 
COLNCITICHCE 


in usin 
sponse contin- 
hive high ac 


a markedly 


he favorability of 
ind of the Others’ 
psv- 
oncerning the 


m of t} e two 


en bodice s 


iVvora 
r more 


in favor 


will correlate positively wi 


| 


ese 

> accuracy - nsofar as the 
Other described himselt 
», favorabilits 


favorably. 
len ( key Ss can be used 


ro expos an otherwise concealed 


ispect of interpersonal 


Then accuracy may result 


per eption. 
not from 
inferring corre- 
or from 
result 
be- 


tween the Judge’s favorable impres- 


perceiving cues and 
lates, from having intuition, 


y } 
infiihiung. 


Rather, it may 


+? 


trom a tortuitous concomitance 
¢ an Other or toward a 
group of which the Other is a mem- 
ber, and the Other's tendency to de- 
scribe himself favorably. 

3. An adjustment key was used 
19) when the predictions and self- 
descriptions were obtained on the 


sion conce rit 
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Bell Adjustment Inventory. This 
made possible measures of the Judges’ 
attribution of adjustment and the 
Others’ self-described adjustment. 
Such measures should have much in 
common with those vielded by the 
favorability key described above. 


Keys Obtained by Varying Instructions 

to Judges 

4. Suppose we obtain the Judge's 
if the items 
as those on which he predicted the 
Other's response (2, 9, 10, 17, 21). 
Comparing this protocol the 
Other's self-des ription yields a score 


self-description on same 


with 


that has been called ‘‘real similarity”; 
comparing the Judge's self-descrip- 
tion with his predictions of the Other 
vields a score for the Judg 
between the 
This procedt 
down any 


re’s assumed 
Other 1 


ang 
] re can be used 


isell 
to break 


int 
Mo 


accuracy score 
two components, ‘warranted 
imi “warranted 
Judges are 
some cases because they 
are highly similar to the Other and 
assume high similarity; in other in- 


assumed similaritv’’ and 


assumed dis 


similarity.” 
accurate in 


stances Judges are accurate because 
they are dissimilar and assume little 
similarity. 

5. When the Judge predicts 


the 
ie typical member of 
the subcultural group to which the 
Other belongs, we can obtain a 
stereotype key (10). Applied to pre- 
dictions, such a key yields a “‘rigid- 
ity’’ score for the Judge, reflecting 
the degree to which he tends to see 
the Others as typical, and a simi- 
larity-to-stereotype for the 
Other. Accuracy occurs if the Judge 
follows his stereotype when the Other 
is actually similar to it. For a group 
of Others who strongly resemble a 
Judge's stereotype, we get high ac- 
curacy when the Judge is “‘rigid”’ in 


responses ol 


score 
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the sense of thinking that the Others 
are all like his stereotype. 

the im- 
pression that the Judge forms as to 
the actual the Other, 
can be distinguished from the Judge's 


6. Manifest stimulus value, 
personality ot 
prediction of how the Other will de- 


scribe himself 
feminate bov n 


lor exan | le, an ef- 
av have 
fest stimulus value th 
as really preterring to play 


we would predict 


such a mani- 
it we would de- 
scribe hin 
yy rls’ Pamcs But 
that he 
on an in 


would ( hoe sec the boy s' yames 


terest inventory because he 


could not be expected to admit to 
: out of line with his 


1 
interests So Ci rh 


ppose we ask a group of 
the, 


necessarily 


sex role. S) 
Judges to a 


scribe al boy as 
think he liv is ar 


reauv is and not 


1, 


is they predict he would respond to 


The result- 
value’ key 
le up of the modal 
udge’s descriptions of the Other. Ap- 
plied t self- lescription 
it vields an “insight-and-fran 
°° 
—_- 
1 
i 


a personality inventory. 


manifest stimulus 


7 


Other's 
ht-i kness”’ 
d to the 
| 


ves, if wields 


score predictions 
ot Ju 
tion-of-insi 


Now ‘ 
the 1 


\pplie 


“attribu- 
nd irat k ess” score, 
assuming correct perception of 
nanifest stimulus value, accuracy 
be fractionated into the J lee’ 
warranted attribution-« 

trankn ‘ ill 
of -self-deception - il 


KIICSS at 1 Wall 


Keys 
Prediction 


- vy 
‘. abit 


two torms 
age predictio 
of Judges in predicting the respor 
le Other, ; for each item, 
prediction of a single 
Judge for several Others The first 
of these can break down the ac curacy 
score t\ picalit v-of-prediction 
and predictabilit y-of-the-Other’s-self- 


ise ol 
a si 


phike 


the average 


into 
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description. When Judges make 
highly typical predictions for highly 
predictable Others, they are 
rate; when they make atypical predic- 


accu- 


tions for Others who are missed by 
the majority of the Judges, they will 
The second kind of 


used to 


also be accurate. 
modal pl 
obtain a measure of the Judge's ‘‘im- 
is involved in 


«diction kev can be 


plicit stereotype.” It 
measures of stereotype accuracy (5 

8. A embodying the 
self-description of the Others (15), i.e., 
the manner in 


kev modal 


which the majority of 
Others desc ribe themselves, \ ield 
score for the similarity of the Judge's 


il self-dess 


sa 
predictions to the mod rip)- 
tion and a score for the Other's typi- 
cality. \ccuracy then 
tion of these two scores. 


} 


icsi 


becomes al 
Palland 


riptio is 


fun 
23) also averaged self- 

but used them only as the key for 
scoring the individual member's ac 
curacy in evaluating group opinion. 


EVALUATING AND USING INTER- 
MEDIARY KEYS 
Intermediary 
Keys 
How can we choose from the many 
kevs 


have genuine value in the analy- 


possibilities those intermediary 
that 
iterpersonal perception? Three 
for intermediary 
kevs are as follows: 
1. The internal 
items of the score obtained by apply- 
ing the intermediary key to the 


sis of 1 
evaluating 


bases 


consistency over 


Unless this in- 
the 


con- 


predictions 


Judges’ 
ternal consistency is substantial, 
intermediary 
sidered psve hologi ally characteristic 
of the Judge. 

2. Degree of confounding of two or 


more possible intermediary keys. For 


score cannot be 


example, the acquiescent choices on 
an inventory (e.g., the California F 
scale, the Minnesota Teacher Atti- 
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tude Inventory) may also tend to be 
the unfavorable choices. If so, we 
can determine which of the two is 
operating only by revising our inven- 
tory. When the confounding is elimi- 
nated, scores obtained with one of the 
kevs may their internal 
sistency. 

a Degree to 
mediary key 


lose con- 
which the inter- 
vields scores logically 
attributable to genuine social interac- 
tion between the Judge and the 
Other. In some situations, a given 
intermediary score may reflect a char- 
acteristic of the Judge which existed 
prior to his observation of the Other. 
In other situations, the same inter- 
may reflect 
influences of observation or interac- 


mediaryv score genuine 
g., a favorable reaction to a 
lar Other. We should 
between such post- 
intermediary 


tion, e 
particu dis- 
tinguish and 
preinteraction scores. 
Evidence tor the preinteraction char- 
acter of an intermediary score is ob- 


tained from the existence of gen- 


erality of the score over Others, espe- 
cially if the Others are heterogeneous 
with respect to the variable(s). When 
the intermediary scores obtained on 


Judges’ predictions for heterogeneous 
Others correlate highly among them- 
the predictions are probably 
autistically rather than 
determined by evidence concerning 
the Others 


selves, 
determined 


Using Intermediary Keys 

These considerations can be used 
intermediary 
keys both empirically and logically. 
Those which prove upon examination 
to be relevant to the particular prob- 
lem at hand should be used in the an- 
interpersonal perception. 
Thus, use of intermediary kevs may 
reveal that the measures obtained in 
a given situation are highly loaded 


to examine possible 


alysis of 
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with irrelevant vari- 
ance. 

If so, such irrelevant influences on 
interpersonal perception measures 
can often be reduced by lowering the 
internal the inter- 
mediary obtained on the 
Judge’s predictions. For example, 
the information available to the 
Judge can be made more relevant to 
the predictions requested. Then the 
Judge may have less tendency to fall 
back upon autistic response sets. Al- 
though assumed similarity scores, for 
example, would still be obtainable, 
they would no longer be general over 
items or Others. An item format in 
which the alternatives 
change from one item to the next will 
reduce the likelihood of a positional 
response set (4) in the Judges’ pre- 
reliable individual dif- 


tendency 


conceptually 


consistency of 


score 


response 


dictions, e v., 
ferences in to ¢ hoose the 
first of two alternatives 
Similarly, a forced-choice format, in 
which the favorability of 


choices has beer e juated 


res] onse 


various 
within each 
item, will reduce the reliable favora- 
bility-of-prediction variance in the 
Judges’ predictions. 

Another method of minimizing the 
influence of general response disposi- 
tions is to vive credit for 
only when the Judge correctly dif- 


ferentiates 


accuracy 


in his predictions for two 
Others (or Other and 
himself) who answered an item dif- 
ferently. A “refined empathy” score 
derived bv subtracting the assumed 
similarity from the accuracy 
score has been suggested (12). Partial 
correlation has also been considered, 


between an 


score 


i.e., basing the accuracy score on the 
partial correlation of the Judge's 
prediction with the Other's self-de- 
scription, holding real similarity con- 
stant. Both of these methods have 
proved on further examination to be 
inadequate (3, 11). We have scored 
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Judges’ predictions on only those 
items on which two Others responded 
ditferently, giving credit for accuracy 
only when the Judge correctly pre- 
dicted the difference. Corrected split- 
half reliabilities for this score for two 
groups of Judges each predicting for 
a ditlerent pair of Others were .65 
and .51. 


‘ 


How Intermediary Scores Account for 


4 } 4 . 
Nongenerality of Accuracy 


litermediary keys have shed light 
on the failure to tind generality over 
Others in measures of accuracy of 
interpersonal perception. Typically, 
accuracy in judging or predicting for 
one Other has correlated less than .35 
with accuracy in judging or predict- 
ing for a dilferent Other (e.g., 1, 2, 
3, 7, 10, 13, 16). At the same time, 
generality over items in accuracy of 
predicting for a single Other has often 
been substantial, i.e., .7 and higher. 

How can we account for such re- 
sults? By use of intermediary keys, 
we have demonstrated (22) that 
standard Others, i.e., fifth-grade boys 
and girls presented to Judges by 
means of sound films, did serve as 
discriminable social stimuli. 
cally, a favorability_key showed that 
the Judges made consistently and 
significantly more favorable precic- 
tions for some of the children than 
for others. The corrected split-half 
reliabilities of favorability-ol- 
prediction scores ranged from .81 to 
.90, with a median of .86. Now the 
accuracy of a Judge in predicting the 
responses ol a particular child de- 
pends in part on the congruence be- 
tween the favorability of his percep- 
tion of the child and the favorability 
of the child’s self-description. Thus, 
the rank order in magnitude of four 
median correlations between favora- 
bility and accuracy was exactly the 
same as the rank order in favorability 


Speciti- 


the 
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of the four children’s self-descrip- 
tions. 

It is instructive to examine the 
structure of the relationship between 
the favorability of a single Judge's 
predictions and of the child's self- 
description. We can form a triad 
from (a) the prediction protocol ot 
the Judge, (6) the self-description 
protocol of the Other, and (c) the 
favorability key. Matching each of 
these against the other vields three 
ab) favora- 
bility-of-prediction, and (dc) favora- 
When any 
two of these scores or proportions are 
in any degree fixed, the third is par- 
tially determined. Thus, suppose 
that, from observing the Other, the 


scores: accuracy, (ac) 


bilit y- f{-self-descript ion. 


Judge forms an over-all impression 
such that he will predict favorably 
concerning the Other on 75 per cent 
of a set of two-choice items. Further, 
that the describes 
himself favorably on 75 per cent of 
the items. In this case, the 
level of accuracy with 
items will no longer be 50 per cent; 
rather, if no determinants of accuracy 
were operating other than the favora- 
bility ‘‘sets,”’ chance success on a set 
of favorability-loaded items would be 
62.5 per cent. Any key may be sub- 
stituted for the favorability key 
without altering the logic of the tri- 
adic relationship. 

In some of 


suppose ( Ither 
“chance” 


two-choice 


our illustrations, as 
above, the 
the “intermediary key” and 
the prediction and self-descriptio: 
protoc ols may be entirely or partiall, 
determined by actual social percep 
tion of the Other by the Judge. Ths 
two scores yielded by the modal pre 
diction key, for example, are / 
least partially determined by genuine 
social interaction between the Judges 
and the Other. 
But other keys 


noted relationship be 


tween 


A] , 
OIn a 


may yield two 
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scores Of which one may be tixed be- 
fore the Judges see the Other. The 
favorability earlier 
illustrates such a possibility. That is, 
the tavorabilitv of the Other's self- 
description is determined before the 
Judge observes the Other. When the 
Intermediary kev is the Judge's prior 
another 
situation where one of the scores (real 
intluenced by the 

th this Other 


key discussed 


iption, we have 


self-desct 


Is ict 

teraction 
“s assumed similarity to 
av also be partially de- 


{ 1 ' 
ball Oils 


extrinsic 
] idee td 
20) that the 


similarity 


on betweer 


may 
rs and over var- 
nly to the extent 
ty score of 

Out ae tually 

n we con- 
interaction 

Lo develop 
neither ot 
prediction 

iv be inftlu- 
ve-Other interac- 
the Cases, as WO ild be true 


ilaritv aif it were a 
ition ceneral over all 
all Others, the scores may 


1 } 
1, ‘ 
Nnowmrical s 


CANCE al- 
rehect i 
other 
response 
iCCUFAaAC\ 


ny soci i] 


consid- 

than mathematical 
ius, the tendeng V of the 

he Other 

when 


to choose a as 
in doubt as to the 
ternative in a_ two-choice 
av vield a triad of this quasi- 
tlere, of course, 


intermediary key consists of all 


hemiatical form. 


u responses. It is hard to imagine how 
such tendencies might be related to 
social interaction between Judge and 
Other, but it is readily apparent that 
they would influence the accuracy of 
the Judges’ predictions. 

How does this discussion of inter- 
mediary keys bear on the problem of 
in perceiving 
feel that the 


generality of accuracy 
standard persons? We 
finding of reliability of accuracy for 
Other without generality over 
Others can be explained by the fortui 
these triad rela- 
tionships. Our methods of measuring 


one 


tous occurrence ol 


iccuracy often make the operation of 
the various global 
ing favorability and tvypicality, and 
response disposit ms, 


liency to asst ‘ simula 


id ments, regard- 


the 


rity or to 


such as 


+ 
LCTi¢ 


res] very influential in the de- 


ond a, 
termination of hence we 
obtain accuracy scores that are reli- 
able over items for a single Other. 
But we kn that a 
demonstration of gener lity of 
ability to predict 
Others requ 


ec 
iusive 
the 


responses ol 


COTM 


the 
the Others be 


By our selection of 


ires that 
dissimilar. dlis- 
similar Others we make it inevitable 
that the combination of the Judyes’ 
attributes and Others’ attributes will 
Thus we 
prejudice the results against a spuri- 


be different for each Other. 


to Ju lge- 
and 
ability on 


; +: , 
ous kind of generality aque 
response conti! 


Other } 
indeed tind relatively littl 


ivencies, 


the part of the Judges to make gen 
differentiations. 

Crow (7) that 
generality over Others of the Judges’ 
for the 
generality over Others of predictive 
accuracy. That is, in Crow’s data, 
accuracy seemed to be general over 
Others to the degree that (a) those 
Others were homogeneous and ()) the 
Judges had response sets that were 
general over the Others. 


nme 
Lilie 


has demonstrated 
sets can 


response account 
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THE INTERMEDIARY KEY IN 
RELATION TO ANALYSIS OF 
VARIANCE 

The intermediary key explicitly 
generalizes many methods of analyz- 
ing interpersonal perception data 
that have already been used. One 
other major approach to analyzing 
interpersonal perception has been 
developed which has not, at first 
glance, involved such keys. This is 
the analysis of variance design de- 
veloped by Cronbach (5). 

The relationship between the in- 
termediary key and analysis of vari- 
ance approach can be seen by refer- 
ring to the k-space model developed 
by Osgood and Suci (18) and by 
Cronbach and Gleser (6). In this 
model, the Judge’s predictions and 
the Other's self-descriptions are each 
represented asa point in a k-space, 
where & is the number of items or 
scales. Accuracy is then measured 
inversely as the distance, D, between 
the two points. D equals the square 
root of the sum over items or scales 
of the squares of the differences be- 
tween predictions and_ self-descrip- 
In the analysis of variance 
approat h, accuracy is dissec ted by 


tions. 


showing that it consists of differences 
between Judges and Others in re- 
sponse set, in scatter over the items, 
in scatter over Others, and so on. 
Each of the arithmetic means in- 
troduced as a reference point from 
which to compute a component of 
analogous to a component 
may 
key. 


accuracy 
be considered an 
For example, the 
Others 


ol variance 
intermediary 
mean self-des riptior of all 
on all items, from which the response 
set component of accuracy is com- 
puted, may be considered an inter- 
mediary key interposed between pre- 
dictions and_ self-descriptions. In 
terms of Cronbach's analysis, this 
key could be used to score the aver- 
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age of the Judge's predictions over all 
items and all Others, yielding what ts 
then called the elevation component 
of accuracy. As a second 

the differences the mean 
self-description of one Other on all 
items and the mean self-description 
of all Others on all items can be used 
as an intermediary key, this time 
applicable to the predictions for a 
particular Other. Applied to the cor- 
responding di the 
Judge's average prediction for all 
Others and for the particular Other 
this kev vield measures of 
what Cronbach called 
elevation. A tinal example 
consisting of the mean response of all 
Others on each item; applied to the 
Judge for all 


example, 
between 


fferences between 


would 
ferential 


is at key 


mean prediction of a 


Others on each item, this would yield 


a measure of the Judge's stereotype 
accuracy 
Each intermediary key 


around 


represents 
which a 
Variance 
Not only means 


a reference point 
component of accuracy 
could be analyzed. 
over items, over Others, over Judges, 
or over any two of these may be used 
to dissect accuracy. Rather, the in- 
termediary kev formulation 
that psvchologi: ally or logically de- 
fined 
for example, a 


shows 


reference points can be used 
pot 
ill “tav- 
orable” or all “adjusted” responses. 
Accuracy can then be investigated as 
a function of 
tween predictions 
the 
“adjustment” 
the many possibilities the 
variance or the user of int 
keys chooses will depend upon his 


reference may 


be constructed « onsisting ol 


the concomitance be- 
and 
‘“favorabilitv’’ and 
Which of 


analyst of 


self-deserip- 
tions along 


dimensions, 
ermediary 
purposes and insight. 


SUMMARY 


The intermediary key consists of 
a protocol, e.g., a set of responses to 
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a questionnaire or rating scale. When 
interposed between the Judge's pre- 
dictions and the Other's self-descrip- 
tions, the intermediary key sheds 
light on the processes involved in 
interpersonal perception. Eight illus- 
trative intermediary keys, drawn 
from recent investigations, are de- 
scribed. Among the considerations 
that may be used in evaluating inter- 
keys are the internal 
sistency of their scores, their degree 
of confounding with other keys, and 
the degree to which the processes re- 


mediary con- 


vealed by such kevs may be attrib- 
uted to between 
Judge and Other, as against autistic 
mathematical arti- 


social interaction 


response sets or 
facts. 


Intermediary kevs have been use- 
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ful in showing how accuracy in pre- 
dicting Others’ responses can be gen- 
eral over items but not over Others. 
Similarly, they have revealed how 
the processes affecting accuracy in 
interpersonal perception may or may 
not be attributable to genuine per- 
ception of the Other by the Judge. 

The intermediary key approach is 
related to the analysis of variance 
approach in analyzing interpersonal 
perception because the reference 
points in both approaches may be 
considered points in a k-space defined 
by the & items or scales of a ques- 
tionnaire or rating scale. The inter- 
mediary key can be used to define 
psychologically as well as mathe- 
matically meaningful reference 
points. 
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